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(57) Abstract 

A polyketide synthase ('PKS") of Type I is a complex multienzyme including a loading domain linked to a multiplicity of extension 
domains. Ths first extension module receives an acyl starter unit from the loading domain and each extension module adds a further ketide 
unit which may undergo processing (e.g. reduction). We have found that the Ksq domain possessed by some PKS's has decarboxylating 
activity, e.g. generating (substituted) acyl from (substituted) malonyl. The CLF domain of type II PKS's has similar activity. By inserting 
loading modules including such domains into PKS's not normally possessing them it is possible to control the starter units used. 
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Polvketides and their Synthesis 

The present invention relates to processes and 
materials (including enzyme systems, nucleic acids, vectors 
5 and cultures) for preparing 14-membered macrolides by 
recombinant synthesis and to the novel polyketides so 
produced. Polyketide biosynthetic genes or portions of 
them, which may be derived from different polyketide 
biosynthetic gene clusters are manipulated to allow the 

10 production of specific novel polyketides, such as 12-, 14- 
and 16-membered macrolides, of predicted structure. This 
invention is particularly concerned with the replacement of 
genetic material encoding the natural starter unit with 
other genes in order to prepare 14-membered macrolides with 

15 preferentially an acetate starter unit, whilst minimising 

the formation of by-products containing a dif f erent^starter 
unit . 

Polyketides are a large and structurally diverse class 
of natural products that includes many compounds possessing 
20 antibiotic or other pharmacological properties, such as 
erythromycin, tetracyclines, rapamycin, avermectin, 
monensin, epothilones and FK506. In particular, 
polyketides are abundantly produced by Streptomyces and 

related actinomycete bacteria. They are synthesised by the 
25 repeated stepwise condensation of acylthioesters in a 

manner analogous to that of fatty acid biosynthesis . The 
greater structural diversity found among natural 
polyketides arises from the selection of (usually) acetate 
or propionate as starter or "extender" units; and from the 
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differing degree of processing of the p-keto group 
observed after each condensation. Examples of processing 
steps include reduction to p-hydroxyacyl- , reduction 
followed by dehydration to 2-enoyl-, and complete reduction 
5 to the saturated acylthioester . The stereochemical outcome 
of these processing steps is also specified for each cycle 
of chain extension. 

The biosynthesis of polyketides is initiated by a 
group of chain- forming enzymes known as polyketide 

10 synthases. Two classes of polyketide synthase (PKS) have 
been described in actinomycetes . One class, named Type I 
PKSs, represented by the PKSs for the macrolides 
erythromycin, oleandomycin, avermectin and rapamycin, 
consists of a different set or "module" of enzymes for each 

15 cycle of polyketide chain extension. For an example see 
Figure 1 (Cortes, J. et al. Nature (1990) 348:176-178; 

Donadio, S. et al. Science (1991) 2523:675-679; Swan, D.G. 

et al. Mol. Gen. Genet. (1994) 242:358-362; MacNeil, D. J. 

et al. Gene (1992) 115:119-125; Schwecke, T. et al. Proc, 

20 Natl. Acad. Sci. USA (1995) 92:7839-7843). 

The term "extension module" as used herein refers to 
the set of contiguous domains, from a P-ketoacyl-ACP 
synthase ("AKS") domain to the next acyl carrier protein 
(*'ACP") domain, which accomplishes one cycle of polyketide 

25 chain extension. The term "loading module" is used to 

refer to any group of contiguous domains which accomplishes 
the loading of the starter unit onto the PKS and thus 
renders it available to the KS domain of the first 
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extension module. The length of polyketide formed has been 
altered, in the case of erythromycin biosynthesis, by 
specific relocation using genetic engineering of the 
enzymatic domain of the erythromycin-producing_PKS that 
5 contains the chain releasing thioesterase/cyclase activity 
(Cort6s et al , Science (1995) 268:1487-1489; Kao, CM. et 

al. J. Am. Chem. Soc. (1995) 117:9105-9106), 

In- frame deletion of the DNA encoding part of the 
ketoreductase domain in module 5 of the erythromycin- 
10 producing PKS (also known as 6-deoxyerythronolide B 

synthase, DEBS) has been shown to lead to the formation of 
erythromycin analogues 5, 6 -dideoxy-3 -a-mycarosyl-5 - 
oxoerythronolide B, 5 , 6-dideoxy-5-oxoei:ythronolide B and 
5,6-dideoxy, 6 p-epoxy-5-oxoerythronolide B (Donadio, S. et 

15 al. Science (1991) 252:675-679) , Likewise, alteration of 

active site residues in the enoylreductase domain of module 
4 in DEBS, by genetic engineering of the corresponding PKS- 
encoding DNA and its introduction into Saccharopolyspora 

erythraea, led to the production of 6 , 7-anhydroerythromycin 
20 C (Donadio, S. et al. Proc Natl. Acad. Sci. USA (1993) 

90 :7119-7123) . 

International Patent Application number WO 93/13663 

describes additional types of genetic manipulation of the 

DEBS genes that are capable of producing altered 
25 polyketides . However many such attempts are reported to 

have been unproductive (Hutchinson, C. R. and Fujii, I. 

Annu . Rev. Microbiol, (1995) 49:201-238; at p. 231). The 

complete DNA sequence of the genes from Streptomyces 
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hygroscopicus that encode the modular Type I PKS governing 

the biosynthesis of the macrocyclic itnmunosuppressant 
polyketide rapatnycin has been disclosed (Schwecke, T. et 

al. (1995) Proc. Natl. Acad. Sci, USA 92:7839-7843). The 

DNA sequence is deposited in the EMBL/Genbank Database 
under the accession number X86780. 

The second class of PKS, named Type II PKSs, is 
represented by the synthases for aromatic compounds . Type 
II PKSs contain only a single set of enzymatic activities 
for chain extension and these are re-used as appropriate in 
successive cycles (Bibb, M. J. et al . EMBO J. (1989) 

8:2727-2736; Sherman, D. H. et al . EMBO J. (1989) 8:2717- 

2725; Fernandez -Moreno, M.A. et al . J. Biol. Chem. (1992) 

267:19278-19290) . The "extender" units for the Type II 
pKSs are usually acetate units, and the presence of 
specific cyclases dictates the preferred pathway for 
cyclisation of the completed chain into an aromatic product 
(Hutchinson, C. R. and Fujii, I. Annu. Rev. Microbiol. 
(1995) 49:201-238) . Hybrid polyketides have been obtained 
by the introduction of clones Type II PKS gene- containing 
DNA into another strain containing a different Type II PKS 
gene cluster, for example by introduction of DNA derived 
from the gene cluster for actinorhodin, a blue-pigmented 
polyketide from Streptomyces coelicolor, into an 
anthraquinone polyketide-producing strain of Streptomyces 
ga.lileus (Bartel, P. L. et al . J. Bacteriol . (1990) 

172 :4816-4826) . 

The minimal number of domains required for polyketide 
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chain extension on a Type II PKS when expressed in a 
Streptomyces coelicolor host cell (the "minimal PKS") has 

been defined for example in International Patent 
Application Number WO 95/08548 as containing the following 
three polypeptides which are products of the act J genes : 

first KS; secondly a polypeptide termed the CLF with end- 
to-end amino acid sequence similarity to the KS but in 
which the essential active site residue of the KS, namely a 
cysteine residue, is substituted either by a glutamine 
residue, or in the case of the PKS for a spore pigment such 
as the whiE gene product (Chater, K. F. and Davis, N. K. 

Mol. Microbiol. (1990) 4:1679-1691) by a glutamic acid 
residue (Figure 2) ; and finally an ACP, The CLF has been 
stated for example in International Patent Application 
Number WO 95/08548 to be a factor that determines the chain 
length of the polyketide chain that is produced by the 
minimal PKS. However it has been found (Shen, B. et al . J. 

Am. Chem. Soc . (1995) 117:6811-6821) that when the CLF for 
the octaketide actinorhodin is used to replace the CLF for 
the decaketide tetracenomycin in host cells of Streptomyces 

g^laucescens , the polyketide product is not found to be 

altered from a decaketide to an octaketide, so the exact 
role of the CLF remains unclear. An alternative 
nomenclature has been proposed in which KS is designated 
KSa and CLF is designated KSp, to reflect this lack of 
knowledge (Meurer, G. et al . Chemistry and Biology (1997) 

4:433-443) . The mechanism by which acetate starter units 
and acetate extender units are loaded onto the Type II PKS 
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is not known, but it is speculated that the malonyl-CoA: 
ACP acyltransf erase of the fatty acid synthase of the host 
cell can fulfil the same function for the Type II PKS 
(Revill, W. P, et al. J. Bacterid. (1995) 177 :J946-3952 ) . 

5 International Patent Application Number WO 95/08548 

describes the replacement of actinorhodin PKS genes by 
heterologous DNA from other Type II PKS gene clusters, to 
obtain hybrid polyketides . The same International Patent 
Application WO 95/08548 describes the construction of a 
10 strain of Streptomyces coelicolor which substantially lacks 

the native gene cluster for actinorhodin, and the use in 
that strain of a plasmid vector pRM5 derived from the low- 
copy number vector SCP2* isolated from StrGptomyces 

coGlicolor (Bibb, J. and Hopwood, D. A. J. Gen. 

15 Microbiol. (1981) 126:427) and in which heterologous PKS- 
encoding DNA may be expressed under the control of the 
divergent act 1/ act III promoter region of the 

actinorhodin gene cluster (Fernandez -Moreno, M.A. et al . J. 

Biol. Chem. (1992) 267:19278-19290). The plasmid pRM5 also 
20 contains DNA from the actinorhodin biosynthetic gene 

cluster encoding the gene for a specific activator protein, 
ActII-orf4. The Act II-orf4 protein is required for 
transcription of the genes placed under the control of the 
acti/ act II bidirectional promoter and activates gene 
25 expression during the transition from growth to stationary 
phase in the vegetative mycelium (Hallam, S. E, et al . Gene 

(1988) 74 : 305-320) . 

Type II clusters, in Streptomyces are known to be 
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activated by pathway- specif ic activator genes (Narva, K. E, 
and Feitelson, J. S. J, Bacteriol . (1990) 172:326-333; 
Stutzman-Engwall, K. J, et al . J, Bacteriol . (1992) 

174:144-154; Fernandez -Moreno, M.A, et al . Cell^ (1991) 

5 66:769-780; Takano, E. et al , Mol . Microbiol. (1992) 

6:2797-2804; Takano, E. et al . Mol, Microbiol. (1992) 

7:837-845), The DnrI gene product complements a mutation 
in the actJJ-orf4 gene of S. coellcolor, implying that DnrI 

and ActII-orf4 proteins act on similar targets. A gene 
10 {srmR) has been described (EP 0 524 832 A2) that is located 

near the Type I PKS gene cluster for the macrolide 
polyketide spiramycin. This gene specifically activates 
the production of the macrolide antibiotic spiramycin, but 
no other examples have been found of such a gene. Also, no 

15 homologues of the Actll-orf 4/DnrI/RedD family of activators 
have been described that act on Type I PKS genes . 

Although large numbers of therapeutically important 
polyketides have been identified, there remains a need to 
obtain novel polyketides that have enhanced properties or 

20 possess completely novel bioactivity. The complex 

polyketides produced by Type I PKSs are particularly 
valuable, in that they include compounds with known utility 
as anthelminthics , insecticides , immunosuppressants , 
antifungal or antibacterial agents . Because of their 

25 structural complexity, such novel polyketides are not 
readily obtainable by total chemical synthesis, or by 
chemical modifications of known polyketides. 



BNSDOCID: <WO 0000500A2_L> 



wo 00/00500 



8 



PCT/GB99/02042 



There is also a need to develop reliable and specific 
ways of deploying individual modules in practice so that 
all, or a large fraction, of hybrid PKS genes that are 
constructed, are viable and produce the desired polyketide 
5 product , 

Pending International Patent Application number 
PCT/GB97/01819 discloses that a PKS gene assembly 
(particularly of Type I) encodes a loading module which is 
followed by at least one extension module. Thus Figure 1 

10 shows the organisation of the DEBS genes. The first open 
reading frame encodes the first multi-enzyme or cassette 
(DEBS 1) which consists of three modules: the loading 
module (ery-load) and two extension modules (modules 1 and 
2) . The loading module comprises an acyltransf erase and an 

15 acyl carrier protein. This may be contrasted with Fig. 1 
of WO 93/13663 (referred to above) . This shows ORFl to 
consist of only two modules, the first of which is in fact 
both the loading module and the first extension module. 
PCT/GB97/01819 describes in general terms the 

20 production of a hybrid PKS gene assembly comprising a 
loading module and at least one extension module, 
PCT/GB97/01819 also describes (see also Marsden, A. F. A. 
et al . Science (1998) 279:199-202) construction of a hybrid 

PKS gene assembly by grafting the wide- specif icity loading 
25 module for the avermectin-producing polyketide synthase 
onto the first multienzyme component (DEBS 1) for the 
erythromycin PKS in place of the normal loading module. 
Certain novel polyketides can be prepared using the hybrid 
PKS gene assembly, as described for example in pending 
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International Patent Application number (PCT/GB97/01810) , 
Patent Application PCT/GB97/01819 further describes the 
construction of a hybrid PKS gene assembly by grafting the 
loading module for the rapamycin-producing polyketide 
5 synthase onto the first multienzyme component (DEBS 1) for 
the erythromycin PKS in place of the normal loading module. 
The loading module of the rapamycin PKS differs from the 
loading modules of DEBS and the avermectin PKS in that it 
comprises a CoA ligase domain, an enoylreductase ("ER") 

10 domain and an ACP, so that suitable organic acids including 
the natural starter unit 3 , 4-dihydroxycyclohexane 
carboxylic acid may be activated in situ on the PKS loading 
domain, and with or without reduction by the ER domain 
transferred to the ACP for intramolecular loading of the KS 

15 of extension module 1 (Schwecke, T. et al. Proc . Natl. 

Acad. Sci. USA (1995) 92:7839-7843). 

The DNA sequences have been disclosed for several Type 
I PKS gene clusters that govern the production of 16- 
membered macrolide polyketides, including the tylosin PKS 
20 from Streptomyces fradiae (EP 0 791 655 A2) , the 

niddamycin PKS from Streptomyces caelestis (Kavakas, S. J. 

et al. J. Bacteriol. (1998) 179:7515-7522) and the 

spiramycin PKS from Streptomyces amhofaciens (EP 0791 655 

A2 ) . All of these gene sequences have in common that they 
25 show the loading module of the PKS to differ from the 

loading module of DEBS and of the avermectin PKS in that 
they consist of a domain resembling the KS domains of the 
extension modules, an AT domain and an ACP (Figure 3) . The 
additional N- terminal KS-like domain has been named KSq 
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because it differs in each case from an extension KS by the 
specific replacement of the active site cysteine residue 
essential for P-ketoacyl-ACP synthase activity by a 
glutamine (Q in single letter notation) residua^ The 
function of the KSq domain is unknown (Kavakas, S.J. et 

al. J. Bacterid. (1998) 179:7515-7522), but its presence 

in these PKSs for 16-membered macrolides is surprising 
because the starter units of tylosin, niddamycin and 
spiramycin appear to be propionate, acetate and acetate 
respectively, that is, the same type of starter unit as in 
DEBS. The AT adjacent to the KSq domain is named here the 
ATq domain. 

When the entire loading module of the tylosin PKS was 
used to replace the analogous loading module in the 
spiramycin PKS in S. ambofaciens (Kuhstoss et al. Gene 

(1996) 183:231-236), the nature of the starting unit was 
stated to be altered from acetate to propionate. Since the 
role of the KSq domain was not understood, no specific 
disclosure was made that revealed either the importance of 
the KSq domain, or the possible utility of these KSq- 
containing loading modules in ensuring the purity of the 
polyketide product in respect of the starter unit, even at 
high levels of macrolide production. The interpretation for 
their results was stated as : "Therefore we believe that the 
experiments described here provide strong experimental 
support for the hypothesis that the AT domains in Type I 
PKS systems select the appropriate substrate at each step 
in synthesis" (Kuhstoss et al . Gene (1996) 183:231-236, at 

p. 235) . These authors noted the analogy with the CLF 
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protein in Type II PKS systems and that the latter protein 
is thought to be involved in determining the chain length. 
They state: "KSq may serve a similar function, although it 
is unclear why such a function would be necessary in the 
5 synthesis of these 16-membered polyketides when it is not 
needed for the synthesis of other complex polyketides such 
as 6 -DEB or rapamycin. In any case the KSq is unlikely to 
be involved in substrate choice at each step of synthesis . " 
(Kuhstoss et al. Gene (1996) 183:231-236). 

10 It has been shown that when genetic engineering is 

used to remove the loading module of DEBS, the resulting 
truncated DEBS in Sacch. erythraea continues to produce low 

levels of erythromycins containing a propionate starter 
unit (Pereda, A. et al , Microbiology (1995) 144:543-553) . 

15 The same publication shows that when in this truncated DEBS 
the methylmalonyl-CoA -specific AT of extension module 1 
was replaced by a malonyl-CoA- specif ic AT from an extension 
module of the rapamycin PKS, the products were also low 
levels of erythromycins containing a propionate starter 

20 unit, demonstrating that the origin of the starter units is 
not decarboxylation of the (methyl )malonyl groups loaded 
onto the enzyme by the AT of module 1, but from direct 
acylation of the KS of extension module 1 by propionyl-CoA . 
This is in contrast to a previous report, using partially 

25 purified DEBSl+TE, a truncated bimodular PKS derived from 
DEBS (Kao, C. M. et al . J. Am. Chem. Soc . (1995) 117:9105- 
9106) and functionally equivalent to DEBSl-TE (Brown, M. J. 
B. et al , , J. Chem. Soc. Chem. Commun . (1995) 1517-1518/ 
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Cortes, J. et al . Science (1991) 2523:675-679), which 

stated that the origins of the starter units for DEBS can 
include methylmalonate units which are loaded onto module 1 
and are decarboxylated by the KS of module 1 (Pieper, R. et 

5 al. Biochemistry (1997) 36:1846-1851). It has now been 

found that when the DEBSl-TE protein is fully purified from 
extracts of recombinant Sacch, erythraea it contains no 

such specific decarboxylase activity (Weissmann, K . et al . 

(1998) Biochemistry, 37., 11012-11017), further confirming 
10 that starter units do not in fact arise from 

decarboxylation of extension units mediated by the KS of 

extension module 1 . 

It is known that the DEBS loading module has a 

slightly broader specificity than propionate only, and in 
15 particular acetate starter units are used both in vitro and 

in vivo, when the PKS containing this loading module is 

part of a PKS that is expressed either in Sacch. erythraea 

the natural host for erythromycin production (see for 
example Cortes, J. et al. Science (1995) 268:1487-1489), or 

20 in an heterologous host such as S. coelicolor (Kao, C. M. 

et al, J. Am. Chetn. Soc. (1994) 116:11612-11613; Brown, M. 

J. B. et al . J. Chem. Soc. Chem. Commun. (1995) 1517-1519), 

In vitro experiments using purified DEBSl-TE have 
demonstrated that propionyl-CoA and acetyl-CoA are 
25 alternative substrates that efficiently supply propionate 
and acetate units respectively to the loading module 
(Wiessmann, K. E. H. et al , Chemistry and Biology (1995) 
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2:583-589; Pieper, R. et al . J. Am. Chem. Soc. (1995) 

117:11373-11374). The outcome of the competition between 
acetate and propionate starter units is influenced by the 
respective intracellular concentrations of propionyl-CoA 
and acetyl-CoA prevailing in the host cell used (see for 
example Kao, C. M. et al . Science (1994) 265:509-512; 

Pereda, A. et al . Microbiology (1995) 144:543-553). It is 

also determined by the level of expression of the host PKS, 
so that as disclosed for example in Pending International 
Patent Application number PCT/GB97/01819 , when recombinant. 
DEBS or another hybrid PKS containing the DEBS loading 
module is over -expressed in Sacch. erythraea, the products 

are generally mixtures whose components differ only in the 
presence of either an acetate or a propionate starter unit. 

There is a need to develop reliable methods for 
avoiding the formation of mixtures of polyke tides with both 
acetate and propionate starter units, and to allow the 
specific incorporation of unusual starter units. It has now 
been found, surprisingly, that the role of the loading 
domains in the PKSs for the 16-membered macrolides tylosin, 
niddamycin and spiramycin is different from that of the 
loading domains of the avermectin PKS and of DEBS. It has 
been realised that the KSq domain of the tylosin PKS and 
the associated AT domain, which is named here ATq, together 
are responsible for the highly specific production of 
propionate starter units because the ATq is specific for 
the loading of methylmalonyl-CoA and not propionyl -CoA as 
previously thought; and the KSq is responsible for the 
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highly specific decarboxylation of the enzyme -bound 
methylmalonate unit to form propionate unit attached to the 
ACP domain of the loading module and appropriately placed 
to be transferred to the KS of extension modul®. 1 for the 
initiation of chain extension. In a like manner the ATq of 
the spiramycin and niddamycin PKSs, and the adjacent KSq, 
are responsible for the specific loading of malonate units 
rather than acetate units as previously believed, and for 
their subsequent specific decarboxylation to provide 
acetate starter units for polyketide chain extension. 

It has also now been found here that not only the PKSs 
for the above-mentioned 16-membered macrolides, but also 
the PKSs for certain 14-membered macrolides particularly 
the oleandomycin PKS from StreptomycGS a^ntihioticus (Figure 

4) and also the PKSs for certain polyether ionophore 
polyketides particularly the putative monensin PKS from 
Streptomyces cinnamonensis (Figure 4) , possess a loading 

domain comprising a KSq domain, an ATq domain, and an ACP. 
In Figure 4 is shown a sequence alignment of the KSq 
domains and of the adjacent linked ATq domains that have 
been identified, showing the conserved active site 
glutamine (Q) residue in the KSq domains, and an arginine 
residue which is conserved in all extension AT domains and 
is also completely conserved in ATq domains. This residue 
is characteristically not arginine in the AT domains of 
either DEBS or of the avermectin PKS loading modules, where 
the substrate for the AT is a non- carboxylated acyl-CoA 
ester (Haydock, S. F. et al . FEES Letters (1995) 374:246- 

248) . The abbreviation ATq is used here to simply to 
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distinguish the AT domains found immediately C-terminal of 
Ksq from extension ATs, and the label has no other 
significance . 

In one aspect this invention provides a PKS 
multienzyme or part thereof, or nucleic acid (generally 
DNA) encoding it, said multienzyme or part comprising a 
loading module and a plurality of extension modules for the 
generation of novel, 14-membered macrolides wherein 

(a) the loading module is adapted to load a malonyl 
residue and then to effect decarboxylation of the loaded 
residue to provide an acetyl residue for transfer to an 
extension module; and 

(b) the extension modules, or at least one thereof 
(preferably at least the one adjacent the loading module) , 
are not naturally associated with a loading module that 
effects decarboxylation of an optionally substituted 
malonyl residue. 

Generally the loading module will also include an ACP 
(acyl carrier protein) domain. 

Preferably the decarboxylating functionality of the 
loading module is provided by a KS (ketosynthase) -type 
domain. Suitably this differs from a KS of a conventional 
extension module by possessing a glutamine residue in place 
of the essential cysteine residue in the active site. It 
is termed Ksq. It may be "natural" or genetically 
engineered, e.g. resulting from site-directed mutagenesis 
of nucleic acid encoding a different KS such as a KS of an 
extension module. 
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Alternatively the decarboxylating functionality can be 
provided by a CLF-type domain of the general type occur ing 
in Type II PKS systems. 

Preferably the loading functionality is provided by an 
5 AT (acyltransf erase) -type domain which resembles an AT 
domain of a conventional extension module in having an 
arginine residue in the active site, which is not the case 
with the AT domains of loader modules which load acetate or 
propionate, e.g. in DEBS or avermectin PKS systems. It may 
10 be termed Atq. Once again, it may be "natural" or 

genetically engineered, e.g. by mutagenesis of an AT of an 
extension module. 

Usually the loading module will be of the form: 

Ksq-ATq-ACP 
15 where ACP is acyl carrier protein. 

In another aspect the invention provides a method of 
synthesising novel, 14-membered polyketides having 
substantially exclusively a desired acetate starter unit by 
providing a PKS multienzyme incorporating a loading module 
20 as defined above which specifically provides the desired 

acetate starter unit. This may comprise providing nucleic 
acid encoding the. multienzyme and introducing it into an 
organism where it can be expressed. 

In further aspects the invention provides vectors and 
25 transformant organisms and cultures containing nucleic acid 
encoding the multienzyme. A preferred embodiment is a 
culture which produces a 14-membered polyketide having a 
desired acetate starter unit characterised by the 
substantial absence of polyketides with different starter 
30 units. Thus, for example, C13-methyl-erythromycin can be 
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produced substantially free from natural analogues 
resulting from the incorporation of propionate starter 
units . 

It is particularly useful to provide a loading module 
5 of the type KSq - ATq-ACP for a PKS gene assembly which 

produces a 14-membered macrolide in order to prepare a 14- 
membered macrolide which contains exclusively or almost 
exclusively an acetate starter unit, even when such PKS 
gene assembly is expressed at high levels in an 

10 actinomycete host cell. Particularly suitable PKSs for 
this purpose are the components of PKSs for the 
biosynthesis of erythromycin, methymycin, oleandomycin, 
tylosin, spiramycin, midecamycin, and niddamycin for all of 
which the gene and modular organisation is known at least 

15 in part. Particularly suitable sources of the genes 

encoding a loading module of the type KSq - ATq-ACP are the 
loading modules of oleandomycin, spiramycin, niddamycin, 
methymycin and monensin which are specific for the loading 
of malonate units which are then decarboxylated to acetate 

20 starter units. 

In the loading module of the type KSq - ATq-ACP the 
domains or portions of them may be derived from the same or 
from different sources, and comprise either natural or 
engineered domains. For example the ATq domain can be 

25 replaced by an AT domain derived from any extension module 
of a Type I PKS, having specificity for loading of malonate 
units, so long as the KSq domain is chosen to have a 
matching specificity towards malonate units. 

Alternatively, the KSq domain in the loading module 
provided of the type KSq - ATq-ACP may be substituted by 



30 
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the CLP polypeptide of a Type II PKS . It is now apparent 
that in contrast to its previous identification as a factor 
uniquely determining chain length, the CLP, in addition to 
any other activities that it may possess, is the analogue 
of the KSq domain and can act as a decarboxylase towards 
bound malonate units . 

The appreciation that the CLF domain of Type II PKS's 
has decarboxylating activity has led us to devise useful 
interventions in Type II systems, e.g. to enhance the 
yields obtainable in some fermentations. Many high- 
yielding industrial fermentations tend to give mixtures, 
owing to the incorporation of undesired starters. This is 
particularly the case in systems which have auxiliary genes 
for generating unusual starters. CLF genes may act to 
produce undesired acyl species, leading to products 
incorporating the undesired acyl units. 

For example the production of oxytetracycline involves 
an unusual malonamido starter. However the undesired 
activity of a CLF domain causes some decarboxylation, 
leading to the incorporation of acetyl instead. Daunomycin 
synthesis likewise involves an unusual starter which is 
liable to the ^^parasitic'' activity of a CLF domain. 

The active site (for decarboxylation) of a CLF domain 
generally includes a glutamine residue, We find that the 
decarboxylating activity of the domain can be removed by a 
mutation by which the Gin residue is converted into (for 
example) Ala. 

Thus in a further aspect the invention provides a 
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system and process for synthesis of a type II (aromatic) 
polyketide, in which a gin residue of a CLF domain of the 
type II PKS is mutated to suppress decarboxylation 
activity. Techniques of site-specific mutagenesis by which 
5 this can be achieved are by now well known to those skilled 
in the art. 

The loading module of the type KSq - ATq-ACP may be 
linked to a hybrid PKS produced for example as in 
PCT/GB97/01819 and PCT/GB97/01810 . It is particularly 

10 useful to link such a loading module to gene assemblies 

that encode hybrid PKSs that produce novel derivatives of - 
14-membered macrolides as described for example in 
PCT/GB97/01819 and PCT/GB97/01810 , 

The invention further provides such PKS assemblies 

15 furnished with a loading module of the type KSq - ATq-ACP, 
vectors containing such assemblies, and transformant 
organisms that can express them. Transf oinnant organisms may 
harbour recombinant plasmids, or the plasmids may 
integrate. A plasmid with an int sequence will integrate 

20 into a specific attachment site (att) of the host's 

chromosome. Transformant organisms may be capable of 
modifying the initial products, eg by carrying out all or 
some of the biosynthetic modifications normal in the 

production of erythromycins (as shown in Figure 5) and for 
25 other polyketides . Use may be made of mutant organisms 

such that some of the normal pathways are blocked, e.g. to 
produce products without one or more "natural" hydroxy- 
groups or sugar groups. The invention further provides 
novel polyketides as producible, directly or indirectly, by 
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transf oxTTiant organisms. This includes polyketides which 
have undergone enzymatic modification. 

In a further aspect the invention provides both 
previously-obtained 14-membered ring macrolides- and novel 
14-membered ring macrolides in a purer form with respect to 
the nature of the acetate starter unit, than was hitherto 
possible. These include 14-membered ring macrolides which 
are either "natural" or may differ from the corresponding 
" natural " compound : 

a) in the oxidation state of one or more of the ketide 
units (ie selection of alternatives from the group: -CO-, - 
CH(OH)-, alkene -CH-, and -CH2- ) where the stereochemistry 
of any -CH{OH)- is also independently selectable; 

b) in the absence of a "natural" methyl side-chain; or 

c) in the stereochemistry of "natural" methyl; and/or ring 
substituents other than methyl. 

It is also possible to prepare derivatives of 14- 
membered ring macrolides having the differences from the 
natural product identified in two or more of items a) to c) 
above . 

Derivatives of any of the afore-mentioned polyketides 
which have undergone further processing by non-PKS enzymes, 
eg one or more of hydroxylation, epoxidation, glycosylation 
and methylation may also be prepared. 

The present invention provides a novel method of 
obtaining both known and novel complex 14-membered 
macrolides having an acetate starter unit substantively 
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free of products differing only in having a propionate 
starter unit. 

Sui table plastnid vectors and genetically engineered 
cells suitable for expression of PKS genes incorporating an 
5 altered loading module are those described in 

PCT/GB97/01819 as being suitable for expression of hybrid 
PKS genes of Type I . Examples of effective hosts are 
Saccharopolyspora. exythraea, Streptomyces coelicolor, 

Streptomyces avermitilis , Streptomyces griseofuscus , 

10 5^treptojnyces cinnamonensis , Streptomyces fradiae, 

Streptomyces longisporoflavus, Streptomyces hygroscopicus , 

Micromonospora griseoruhida, Streptomyces lasallensds , 

Streptomyces venezuelae, Streptomyces antibiotlcus , 

Streptomyces lividans , Streptomyces rimosus, Streptomyces 

15 alJbus, Amycolatopsis mediterranei , and Streptomyces 

tsukuhaensis . These include hosts in which SCP2* -derived 

plasmids are known to replicate autonomously, such as for 
example S. coelicolor, S. avermitilis and S, griseofuscus ^ 

and other hosts such as Saccharopolyspora erythraea in 

20 which SCP2* -derived plasmids become integrated into the 
chromosome through homologous recombination between 
sequences on the plasmid insert and on the chromosome; and 
all such vectors which are integratively transformed by 
suicide plasmid vectors. 

25 

Although some 3.3 -methyl erythromycins (also known as 15- 
norerythromycins) have been reported previously (Kibwage et 
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al., J. Antibiotics, 40, 1-6, 1987; Weber & McAlpine, U.S. 

Patent 5,141,926), these have been confined to 15- 
norerythromycin C, and 6-deoxy-15-norerythroTnycins B and D, 
Moreover, not only have these l5-norerythromycins been 
found as extremely minor components co-expressed with high 
levels of "natural" erythromycins (13-ethyl erythromycins) , 
but the 13 -methyl counterparts (15 -norer^'throraycins A and 
B) to the most desirable and biologically-active "natural" 
erythromycins (erythromycin A and B) have never been 
previously isolated. Chemical modification of "natural" 
erythromycins has proven to be an extremely effective means 
for enhancing the bioefficacy of the "natural" molecules. 
Thus, it would be envisaged that chemical modification of 
novel erythromycins would similarly produce compounds with 
desirable and enhanced bioef f icacies . PCT/GB97/01819 
describes in general terms the production of novel 
polyketides through recombinant DNA technologies, and the 
use of these technologies to generate novel erythromycins, 
many of which have different starter units to the 
propionate starter unit characteristic of the "natural" 
er^'-thromycins, are described in pending International 
Patent Application PCT/GB97 /0181C , Some chemical 
modification of these novel er^'thromycins are also 
described in co-pending International Patent Applications 
PCT/IB98/02100 and PCT/IB98/02099 . However, it is clear that the 
ability to produce novel erythromycins at good expression 
levels and in the substantial absence of novel or natural 
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erythromycins with different starter units is essential to 
facilitate the ability to achieve a wide range of chemical 
modifications to such novel erythromycins. The enhanced 
ability to produce polyketides at good expression levels 
5 and in the substantial absence of polyketides with 
different starter units has been described in this 
application are family members, and we now describe the 
ability to produce 13 -methyl erythx'omycins at good 
expression levels and in the substantial absence of 

10 erythromycins with different starter units. The use of 

this technology has now permitted the preparation of large 
amounts of 13 -methyl er^'thromycins which for the first time 
has permitted us to carry out a wide range of chemical 
modifications which had only been previously possible 

15 starting from the "natural" erythromycins. 

Some embodiments of the invention will now be 
described with reference to the accompanying drawings in 
which: 

20 Fig 1 is a diagram showing the functioning of 6- 

deoxyerythronolide B synthase (DEBS) , a modular PKS 
producing 6-deoxyerythronolide B (6 -DEB) a precursor of 
erythromycin A. 

Fig 2 gives the amino acid sequence comparison of the 

25 KS domains and the CLF domains of representative Type II 

PKS gene clusters. The active site Cysteine (C) of the KS 
domains is arrowed in the Figure and aligns with the 
Glutamine (Q) or glutamic acid (E) of the CLF domains. The 
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abbreviations used, and the relevant Genbank/EMBL accession 
numbers are: GRA: granaticin from Streptomyces 

violaceoruher (X63449) / HIR: unknown polyketide from 

Saccharopolyspora hirsuta (M98258) ; ACT, actinorhodin from 

Streptomyces coelicolor (X63449) ; GIN: unknown polyketide 

from Streptomyces ciimamonensis (Z11511) ; VNZ : jadomycin 

from Streptomyces venezuelae (L33245) ; NOG: anthracyclines 

from Streptomyces nogalater {Z48262) ; TCM: tetracenomycin 

from S. glaucescens (M80674) ; DAU: daunomycin from 

Streptomyces sp. C5 (L34880) ; PEU, doxorubicin from 

Streptomyces peucetius (L35560) ; WHI : WhiE spore pigment 

from Streptomyces coelicolor (X55942) , 

Fig 3 shows the gene organisation of the PKSs for 
three IS-membered ring macrolides, tylosin, spiramycin and 
niddamycin. 

Fig 4 shows the amino acid sequence alignment of KSq- 
ATq loading didomains of the PKSs for niddamycin, 
platenolide (spiramycin) , monensin, oleandomycin and 
tylosin. The sequences for the monensin and oleandomycin 
loading didomains* have not been previously disclosed. 

Fig. 5 The enyzmatic steps that convert 6- 
deoxyerythronolide B into erythromycin A in 

Sa ccharopolyspora erythraea 

Fig. 6 is a diagram showing the construction of 
plasmid pJLKll? . 

Fig. 7 shows the structures of two oligonucleotides. 
The present invention will now be illustrated, but is 
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not intended to be limited, by means of some examples. 



All NMR spectra were measured in CDCI3 using a Bruker 500MHz 
DMX spectrometer unless otherwise indicated and £eak positions 
5 are expressed in parts per million (ppm) downfield from 
tetramethysilane . The atom number shown in the NMR structure 
is not representative of standard nomenclature, but correlates 
NMR data to that particular example. 

10 HPLC methods 

Method A 

Column Waters Symmetry 5_ C18 2.1mm X 150 mm 
Flow 0.2 9 ml/min 

15 Mobile phase Gradient: A:B (22:78) to A:B (38:62) 

over 12 minutes, then to A:B (80:20) 
by minute 15 . Maintain for 1 minute^. 
Re-equilibrate before next sample. 
Where A = acetonitrile and B = 0 . OlM 
20 ammonium acetate in 10% acetonitrile 

and 0.02% TFA 
Instrument Acquired with Hewlett-Packard 1050 

liquid chromatograph interfaced to 
a VG Platform II mass spectrometer 
25 equipped with an APCI source 

Method B 

Column Waters Symmetry 5_ C18 2.1mm X 150 mm 
Flow 0.29 ml/min 

30 Mobile phase Gradient ; 28 : 72 acetonitrile : lOmM NH40Ac to 
50:50 in 18 minutes. 50:50 until 25 minutes. Back to 28:72, 
re-equilibrate for 7 minutes 

Instrument Acquired with Hewlett Packard 1100 LC/MS with 

APCI source 

35 

Tap Water medium 

glucose 5g/liter 
tryptone 5g/liter 
yeast extract 2.5g/liter 
40 EDTA 3 6mg/liter 

Tap water to IL total volume 

ERY - P mf^djiim 

dextrose 50g/liter 
Nutrisoy '^^ flour 30g/liter 
(NH4)2S04 3g/liter 
NaCl 5g/liter 
CaCOa 6g/liter 
50 Tap water to IL total volume 

pH adjusted to 7,0 
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Example 1 

Construction of the Recombinant Vector PPFL43 

5 Plasmid pCJR24 was prepared as described in 

PCT/GB97/01819 . pPFL43 is a pCJR24-based plasmid 
containing the gene encoding a hybrid polyketide synthase 
that contains the putative monensin PKS loading module 
(isolated from S. cinnamonensis) the DEBS extension 

10 modules 1 and 2 and the chain- terminating thioesterase . 
Plasmid pPFL43 was constructed as follows: 

The following synthetic oligonucleotides: 5'- 
CCATATGGCCGCATCCGCGTCAGCGT-3 ' and 5 ' - 
15 GGCTAGCGGGTCCTCGTCCGTGCCGAGGTCA-3 ' 

are used to amplify the DNA encoding the putative 
monensin-producing loading module using a cosmid that 
contains the 5' end of the putative monensin-producing 
PKS genes from S. cinnamonensis or chromosomal DNA of 

20 cinnamonensis as template. The PGR product of 3.3 kbp is 

purified by gel electrophoresis, treated with T4 
polynucleotide kinase and ligated to plasmid pUClS, which 
has been linearised by digestion with Sma 1 and then 

treated with alkaline phosphatase. The ligation mixture 
25 was used to transform electrocompetent E.coli DHIOB cells 

and individual clones were checked for the desired 
plasmid pPFL40. Plasmid pPFL40 was identified by 
restriction pattern and sequence analysis. 
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Plasmid pHD30His is a derivative of pNEWAVETE 
(PCT/GB97/01810) which contains the avermectin loading 
module/ erythromycin extension modules 1 and 2 and the 
ery thioesterase domain. Plasmid pNEWAVETE was" cut with 
5 EcoRI and HinDIII and a synthetic oligonucleotide linker 
was inserted that encodes the addition of a C-terminal 
polyhistidine tail to the polypeptide. The following 
oligonucleotides : 

5 ' -AATTCACATCACCATCACCATCACTAGTAGGAGGTCTGGCCATCTAGA-3 ' 
10 and 

5 ' -AGCTTCTAGATGGCCAGACCTCCTACTAGTGATGGTGATGGTGATGTG-3 ' 
were annealed together and the duplex was ligated to 
EcoRI-and HinDIII-cut pNEWAVETE. The resulting plasmid 
was cut with Ndel and Xbal and ligated into plasmid 
15 pCJR24 that had been previously cut with same two 
enzymes, to produce plasmid pNDBOHis, 

Plasmid pPFL40 was digested with Nde I and Nhe I and 

the 3,3 khp fragment was purified by gel electrophoresis 
and ligated to pND30-His previously digested with Nde I 

20 and Nhe 1 and treated with alkaline phosphatase. The 

ligation mixture was used to transform electrocompetent 
E.coli DHIOB cells arid individual clones were checked for 

the desired plasmid pPFL43 . Plasmid pPFL43 was 
identified by restriction analysis. 

25 

Example 2 

Construction of S. ervthraea NRRL23 3 8/pPFL43 
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Plasmid pPFL43 was used to transform S. erythraea 
NRRL2338 protoplasts, Thiostrepton resistant colonies 
were selected in R2T20 medium containing 10 M-g/ml of 

thiostrepton. Several clones were tested for the 
5 presence of pPFL43 integrated into the chromosome by 
Southern blot hybridisation of their genomic DNA with 
DIG- labelled DNA containing the mon PKS fragment 

encoding for the loading module. A clone with an 
integrated copy of pPFL43 was selected in this way. 
10 Example 3 

Production of 13 -methyl -ervthromycin A and B using Sacch. 

erythraea NRRL 233 8/pPFI:i43 

The culture Saccharopolyspora. erythraea 
NRRL233 8 (pPFL43) , constructed with the wild-type loading 

15 domain displaced by a monensin loader-DlTE DNA insert, 
produced as described in Example 2, was inoculated into 
30ml Tap Water medium with 50 ug/ml thiostrepton in a 3 00ml 
Erlenmeyer flask. After three days incubation at 29°C, this 
flask was used to inoculate 300 ml of ERY-P medium in a 3 00 

20 ml flask. The broth was incubated at 29°C at 2 00 rpm for 6 
days. After this time, the whole broth was adjusted to pH 
8.5 with NaOH, then extracted with equal volume of ethyl 
acetate . The ethyl acetate extract was evaporated to 
dryness at 45*^C under a nitrogen stream using a Zymark 

25 TurboVap LV Evaporator, then reconstituted in 0.0625 

volumes methanol to concentrate the extract 16 -fold. The 
structures of the- products were confirmed by LC/MS, Method 
A, A 4.0 min retention time peak was observed as the major 
component, with m/z value of 720 (M+H)"^, required for 13- 

3 0 methyl -erythromycin A. A second peak was observed with a 

retention time of 6.4 min and with m/z value of 704 (M+H)"", 
required for 13 -methyl -erythromycin B, 

Example 4 

35 Production and Recovery of 13 -methyl -ervthromvc in A and B 
using Sacch, erythraea NRRL-233 8 (tdPFL43) at 8L scale 

Saccharopolyspora erythraea NRRL23 3 8 (pPFL4 3) was 
inoculated into lOOOmls Tap Water medium with 50 ixg/ml 
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thiostrepton in a 2.81 Fernbach flask. After three days 
incubation at 29°C, this flask was used to inoculate 81 of 
ERY-P medium in a 141 Microferm fermentor jar (New 
Brunswick Scientific Co., Inc., Edison, NJ) . The broth was 
5 incubated at 28 °C with an aeration rate of 81/min, stirring 
at 800 rpm and with pH maintained between 6.9 and 7.3 with 
NaOH or H2SO4 (15%) . Water was added to maintain volume at 
the 24 hour volume level. The fermentation was continued 
for 167 hours. After this time, presence of 13 -methyl - 

10 erythromycin A and B were confirmed by adjusting a broth 
sample from the fermentor to pH 8.5 with NaOH, then 
extracting with equal volume of ethyl acetate. The ethyl 
acetate extract was evaporated to dryness at 45^C under a 
nitrogen stream using a Zymark TurboVap LV Evaporator, then 

15 reconstituted in 0.25 volumes methanol to concentrate the 
extract 4 -fold. The structures of the products were 
confirmed by LC/MS, Method A. A 4.1 min retention time 
peak was observed as the major component, with m/z value of 
720 (M+H)"^, required for 13 -methyl -erythromycin A. A second 

20 peak was observed with a retention time of 6.6 min and with 
^ m/z value of 704 (M+H)^, required for 13 -methyl-erythromycin 
B. 

About 35 liters of broth containing approximately 2.8 
grams of 13 -methyl- erythromycin A were processed for recovery 

25 of product. Broth was filtered through a pilot sized Ceraflo 
ceramic unit and loaded onto a 500ml XAD-16 resin column. The 
product was eluted using 100% methanol. A 175ml CG-161 
adsorption column was prepared and equilibrated with 20% 
methanol /water . A portion of the product solution was adjusted 

30 to 2 0% methanol and loaded onto the column, no breakthrough 
of product was observed. Washing of the column with up to 40% 
methanol/water failed at removing any significant level of 
impurities. Elution with 50% methanol/water achieved 
chromatographic separation of the product from the two major 

35 impurities, 13 -methyl -erythromycin B and a degradation 
product, 13-methyl-dehydroerythromycin A. The purest cuts were 
combined and reduced in volume by approximately 75% using 
evaporation to achieve <10% methanol concentration. To enhance 
13 -methyl -erythromycin A extraction, solid sodium bicarbonate 

40 was added until a total concentration of 250mM was obtained. 
The aqueous product layer was extracted 2x with methylene 
chloride, using one-half the total volume each time. The 
volume was reduced to light yellow solids by evaporation. The 
13 -methyl-erythromycin A was purified by dissolving the crude 

45 crystals into methylene chloride at ambient temperature and 
diluting to 15% methylene chloride with hexane . The cloudy 
solution is placed at -10°C for ^30 minutes when the liquid 
is decanted to a 2""^ flask, leaving the majority of impurities 
behind as an oil. The flask is left overnight at -10°C, 

50 followed by filtration of of f-whitel3 -methyl-erythromycin A 
crystals the next day. Approximately 300 milligrams of 13- 
methyl- erythromycin A were isolated from the partial work-up 
of the 351 broth volume. 
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Approximately 100 grams of evaporated mother liquor were 
utilized further to isolate 13 -methyl- erythromycin B . Residual 
13 -methyl -erythromycin A was removed with repetitive 
extraction of the initial sample with aqueous acetic acid (pH 
5) . The subsequent methylene chloride layer was 

chromatographed on 700 g of silica gel using 20% methanol in 
methylene chloride. The 13 -methyl -erythromycin B enriched 
fractions, as determined by LC/MS, were combined and 
evaporated to yield "11. 0 grams of dark oil. The oil was 
dissolved in a minimal amount of methanol and loaded onto 500 
ml of Amberchrom CG-161 resin. The 13 -methyl -erythromycin B 
was eluted at 2 bed volumes per hour with 40% methanol in 
deionized water. One bed volume fractions were collected and 
assayed by LC/MS. Fractions 42 through 62 were combined, 
diluted to 20% methanol with deionized water, and neutralized 
to pH 7.5 with sodium bicarbonate . The resulting solution was 
extracted once with 41 of methylene chloride, concentrated to 
500 ml, and dried over anhydrous magnesium 

sulfate. After removal of the MgS04 by filtration the 
filtrate was evaporated to give ~110 mg of light brown solids 
The 110 mg of crude 13-methyl-erythromycin B was dissolved in 
3.0 milliliters of HPLC grade acetonitrile and loaded onto 
a 2 0cm X 20cm, 2mm thick, silica gel preparative thin layer 
chromatography (PTLC) plate. The plate was developed with 
60:40 methanol: acetonitrile. The desired portion of silica 
from the PTLC plate (iodine visualisation) was removed and 
extracted with HPLC grade acetone. The acetone extract was 
evaporated to give 12.1 mg of clear solid. 

Identification of the 13-methyl-erythromycin A and 13- 
methyl-erythromycin B samples were confirmed by mass 
spectroscopy (LC/MS Method B) and NMR spectroscopy The 13- 
methyl-erythromycin A sample peak had a 4.7 min retention 
time, with m/z value of 720 (M+H) % required for 13-methyl- 
erythromycin A. The 13-methyl-erythromycin B sample peak had 
a 7.6 mm retention time, with m/z value of 704 (M+H)*, 
required for 13-methyl-erythromycin B. 

NMR. 13-methvl-ervthromYcin A : 
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NMR, 13-inethyl-erythromycinB: 
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Construction of plasmid PPFL35 

Plasmid pPFL35 is a pCJR24 -based plasmid containing a 
PKS gene comprising a loading module, the first and second 
extension modules of DEBS and the chain terminating 
thioesterase . The loading module comprises the KSq domain 
DNA from the loading module of the oleandomycin PKS fused 
to the malonyl-CoA- specif ic AT of module 2 of the rapamycin 
PKS, in turn linked to the DEBS loading domain ACP, Plasmid 
pPFL35 was constiructed via several intermediate plasmids as 
follows : 

A 411 bp DNA segment of the eryAI gene from 

S.erythraea extending from nucleotide 1279 to nucleotide 

1690 (Donadio, S. et al . , Science (1991) 2523:675-679) was 

amplified by PGR using the following synthetic 
oligonucleotide primers:- 

5 ' -TGGACCGCCGCCAATTGCCTAGGCGGGCCGAACCCGGCT- 3 ' and 
5 < -CCTGCAGGCCATCGCGACGACCGCGACCGGTTCGCC-3 • 

The DNA from a plasmid designated pKSW, derived from 
pT7-7 and DEBSl-TE in which new Pst I and Hindlll sites had 

been introduced to flank the KSl of the first extension 
module, was used as a template. The 441 bp PGR product was 
treated with T4 polynucleotide kinase and ligated to 
plasmid pUClS, which had been linearised by digestion with 
Sma. I and then treated with alkaline phosphatase . The 

ligation mixture was used to transform electrocompetent 
E.coli DHIOB cells and individual clones were checked for 

the desired plasmid, pPFL26- The new Mfe 1/Avr II sites 
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bordering the insert are adjacent to the Eco RI site in the 

polylinker of pUClB. Plasmid pPFL26 was identified by 
restriction pattern and sequence analysis . 

An Mfe I restriction site is located 112 bp from the 

5 5' end of the DNA encoding the propionyl-CoA: AQP 
transferase of the loading module of DEBS. Plasmid pKSW was 
digested with Mfe I and Pst I and ligated with the 411 bp 

insert obtained by digesting plasmid pPFL2 6 with Mfe I and 

Pst I. The ligation mixture was used to transform 

10 electrocompetent E.coli DHIOB cells and individual clones 

were checked for the desired plasmid, pPFL27, Plasmid 
pPFL2 7 contains a PKS gene comprising the DEBS loading 
module, the first and second extension modules of DEBS and 
the DEBS chain terminating thioesterase . Plasmid pPFL27 was 
15 identified by its restriction pattern. 

Plasmid pPFL27 was digested with Nde I and Avr II and 

ligated to a 4 . 6kbp insert derived from digesting plasmid 
pM06 (PCT/GB97/01819) with Nde I and Avr II. Plasmid pM06 

contains a PKS gene comprising the DEBS loading module, the 
20 first and second extension modules of DEBS and the DEBS 
chain terminating thioesterase, except that the DNA segment 
encoding the methylmalpnate- specif ic AT within the first 
extension module has been specifically substituted by the 
DNA encoding the malonate- specif ic AT of module 2 of the 
25 rap PKS. The ligation mixture was used to transform 

electrocompetent E. coli DHIOB cells and individual clones 

were checked for the desired plasmid, pPFL28. Plasmid 
pPFL28 contains a hybrid PKS gene comprising the DEBS 
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loading module, the malonate -specific AT of module 2 of the 
rap PKS, the ACP of the DEBS loading module, followed by 
the first and second extension modules of DEBS and the DEBS 
chain terminating thioesterase . Plasmid pPFL28 was 
identified by restriction analysis. 

A DNA segment encoding the KSq domain from the oleAI 
gene of S. antihioticus extending from nucleotide 1671 to 

nucleotide 3385 was amplified by PGR using the following 
synthetic oligonucleotide primers : - 
5 • - CCACATATGCATGTCCCCGGCGAGGAA- 3 » and 
5 * -CCCTGTCCGGAGAAGAGGAAGGCGAGGCCG-3 ' 

and chromosomal DNA from Streptomyces antibioticus as a 

template. The PGR product was treated with T4 
polynucleotide kinase and ligated to plasmid pUClB, which 
had been linearised by digestion with Sma I and then 

treated with alkaline phosphatase. The ligation mixture was 
used to transform electrocompetent K. coli DHIOB cells and 

individual clones were checked for the desired plasmid, 
pPFL31. The new NdG I site bordering the insert is adjacent 

to the Eco RI site of the pUClB polylinker while the new 

Bsp EI site borders the Hin dlll site of the linker region. 

Plasmid pPFL31 was identified by restriction and sequence 

analysis . 

Plasmid pPFL31 was digested with Nde I and Avr II and 

the insert was ligated with plasmid pPFL28 that had been 
digested with Nde I and Avr II. The ligation mixture was 

used to transform electrocompetent E.coli DHIOB cells and 
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10 



individual Clones were checked for the desired plasmid, 
PPFL32. Plasmid pPFL32 was identified by restrictiol 

analysis . 

Plasmid PPFL32 was digested with Nds I and Xba I and 
the insert was ligated to plasmid pCJR24, which had been 
digested with Nde I and Xba I and purified by gel 
electrophoresis. The ligation mixture was used to transform 
electrocompetent E.coll DHIOB cells and individual clones 
were checked for the desired plasmid, pPFL35 . Plasmid 
PPPL35 was identified by restriction analysis. 

Example 6 

Construction of .«?. ervthr^^^, NRRL2 ^ ft /p Pi?T.:> 

Plasmid PPFL35 was used to transform S.ezythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies 
were selected in R2T20 medium (Yamamoto et al . ) 
containing lo ^g/ml of thiostrepton. Several clones were 
tested for the presence of pPFL35 integrated into the 
chromosome by Southern blot hybridisation of their 
genomic DNA with DIG- labelled DNA containing the rap PKS 
fragment encoding for module 2 AT. A clone with an 
integrated copy of pPFL35 was identified in this way. 

Examp l«=^ 7 

25 

pro duction of 13 -m^fhyl -pr^rthrornvnin A .r.^ ^ ^^^^^ 
ervthr^f a NRRI.-233 8 (pPFT.^^ . h 

The culture Saccharopolyspora erythraea 
30 NRRL2338(pPFL35), constructed with the wild- type loading 
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domain displaced by an oleandomycin KSQ-rapamycin AT2-D1TE 
DNA insert, prepared as described in Example 6, was 
inoculated into 30ml Tap Water medium with 50 ug/ml 
thiostrepton in a 300ml Erlenmeyer flask. After two days 
5 incubation at 29°C, this flask was used to inoculate 300 ml 
of ERY-P medium in a 300 ml flask. The broth was incubated 
at 29 C at 200 rpm for 6 days. After this time, the whole 
broth was adjusted to pH 8 . 5 with NaOH, then extracted with 
an equal volume of ethyl acetate. The ethyl acetate extract 

10 was evaporated to dryness at 45°C under a nitrogen stream 
using a Zymark TurboVap LV Evaporator, then reconstituted 
in 0.25 volumes methanol to concentrate the extract 4- fold. 
The structures of the products were confirmed by LC/MS, 
Method A. A peak was observed with a retention time of 4 . 0 

15 min and with an m/z value of 72 0 (M+H) \ required for 13- 

methyl-erythromycin A (CaGHgsNOaa) . A second peak was observed 
with a retention time of 6.4 min and with m/z value of 704 
(M+H)% required for 13 -methyl -erythromycin B (CaeHssNGis) . 

20 Example 8 

Construction of Recombinant Vector pPFI.44 

Plasmid pPFL44 is a pCJR24- based plasmid containing 
the gene encoding a hybrid polyketide synthase that 
contains the spiramycin PKS loading module, the 
25 erythromycin extension modules 1 and 2 and the chain- 
terminating thioesterase . Plasmid pPFL44 was constructed 
as follows: 



The following synthetic oligonucleotides: 
3 0 5 * - CCATATGTCTGGAGAACTCGCGATTTCCCGCAGT - 3 » and 
5 » -GGCTAGCGGGTCGTCGTCGTCCCGGCTG-3 ' 

were used to amplify the DNA encoding the spiramycin- 
producing loading module using chromosomal DNA from the 
spiramycin producer S, ambofa^ciens prepared according to 
35 the method described by Hopwood et al . (1985) . The PGR 

product of 3.3 kbp was purified by gel electrophoresis, 
treated with T4 polynucleotide kinase and ligated to 
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plasmid pUC18, which had been linearised by digestion 
with Sma T and then treated with alkaline phosphatase . 
The ligation mixture was used to transform 
electrocompetent E.coli DHIOB cells and individual clones 

were checked for the desired plasmid pPFL41. Plasmid 
pPFL41 was identified by restriction pattern and sequence 
analysis . 

Plasmid pPFL41 was digested with Nde I and Nhe I and the 
3.3 kbp fragment was purified by gel electrophoresis and 
ligated to pNDSO ( a plasmid derived from plasmid pCJR?-^ 
having as insert the ave PKS loading module and extension 
modules 1 and 2 or DEBS and the DEBS thioesterase) 
(PCTGB97/01810) previously digested with Nde I and Nhe I* 
and treated with alkaline phosphatase. The ligation 
mixture was used to transform electrocompetent E.coli 
DHIOB cells and individual clones checked for the desired 
plasmid pPFL44 . Plasmid pPFL44 was identified by 
restriction analysis. 

Example 9 

Construction of Sacch, erythraaa NRR Ii23 3 8/T:>PFL44 

Plasmid pPFL44 was used to transform S . erythraea 
NRRL2338 protoplasts. Thiostrepton resistant colonies 
were selected in R2T20 medium containing 10 |ag/ml of 
thiostrepton. Several clones were tested for the presence 
of pPFL44 integrated into the chromosome by Southern blot 
hybridisation of their genomic DNA with DIG- labelled DNA 
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containing the spiramycin PKS fragment encoding for the 
loading module. A clone with an integrated copy of 
pPPL44 was identified in this way. 



5 Example 10 

Production of 13 "Piethvl- erythromycin A and B using Sacch. 
ervthraea NRRL-2338 (pPFL44) 

The culture Saccharopolyspora erythraea NRRL233 8 

10 (pPFL44), constructed with the wild- type loading domain 
displaced by spiramycin loader-DlTE DNA insert, was 
inoculated into 30ml Tap Water medium with 50 ug/ml 
thiostrepton in a 300ml Erlenmeyer flask. After three days 
incubation at 29°C, this flask was used to inoculate 300 ml 

15 of ERY-P medium in a 300 ml flask. The broth was incubated 
at 29°C at 200 rpm for 6 days. After this time, the whole 
broth was adjusted to pH 8.5 with NaOH, then extracted with 
equal volume of ethyl acetate. The ethyl acetate extract 
was evaporated to dryness at 45^C under a nitrogen stream 

20 using a Zymark TurboVap LV Evaporator, then reconstituted 
in 0.0625 volumes methanol to concentrate the extract 16- 
fold. The structures of the products were confirmed by 
LC/MS, Method A. A 4.0 min retention time peak was observed 
as the major component, with m/z value of 720 (M+H)*, 

25 required for 13 -methyl -erythromycin A (CagHgsNOia) . A second 
peak was observed with a retention time of 6.4 min and with 
m/z value of 704 (M+H)"^, required for 13 -methyl -erythromycin 
B (C36H65NO12) . 



30 Example 21 



Construction of plasmid pJLK114 



Plasmid pJLK114 is a pCJR24 based plasmid containing a 
35 PKS gene comprising the ery loading module, the first and 
the second extension modules of the ery PKS and the ery 
chain- terminating thioesterase except that the DNA 
segment between the end of the acyltransf erase and the 
beginning of the ACP of the second ery extension module 
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has been substituted by a synthetic oligonucleotide 
linker containing the recognition sites of the following 
restriction enzymes: Avrll, Bglll, SnaBI, PstI, Spel, 
Nsil, Bsu36I and Hpal . It was constructed via several 
intermediate plasmids as follows (Figure 6). 

Construction of plasmid pJLK02 

The approximately 1.47 kbp DNA fragment of the eryAI gene 
of S. erythraea was amplified by PCR using as primers the 
synthetic oligonucleotides: 

5 » - TACCTAGGCCGGGCCGGACTGGTCGACCTGCCGGGTT- 3 » and 
5 ' -ATGTTAACCGGTCGCGCAGGCTCTCCGTCT-3 » and plasmid pNTEP2 
(Oliynyk, at al . , Chemistry and Biology (1996) 3:833- 

839; WO98/01546) as template. The PCR product was treated 
with T4 polynucleotide kinase and then ligated with 
plasmid pUClB, which had been linearised by digestion 
with Smal and then treated with alkaline phosphatase. The 
ligation mixture was used to transform electrocompetent 
E. coli DHIOB cells and individual colonies were checked 
for their plasmid content. The desired plasmid pJLK02 was 
identified by its- restriction pattern and DNA sequencing. 

Construction of plasmid pJLK03 

The approximately 1,12 kbp DNA fragment of the eryAI gene 
of S. erythraea was amplified by PCR using as primers the 
synthetic oligonucleotides: 

5 » -ATGTTAACGGGTCTGCCGCGTGCCGAGCGGAC-3 ' and 
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5'-CTTCTAGACTATGAATTCCCTCCGCCCAGC-3' and plasmid pNTEPH 
as template. The PGR product was treated with T4 
polynucleotide kinase and then ligated with plasmid 
pUC18, which had been linearised by digestion with Smal 
and then treated with alkaline phosphatase. The ligation 
mixture was used to transform electrocompetent E. coli 
DHIOB cells and individual colonies were checked for 
their plasmid content. The desired plasmid pJLK03 was 
identified by its restriction pattern and DNA sequencing. 

Construction of plasmid pJLK04 

Plasmid pJLK02 was digested with PstI and Hpal and the 
1,47 kbp insert was ligated with plasmid pJLK03 which had 
been digested with PstI and Hpal . The ligation mixture 
was used to transform electrocompetent E. coli DHIOB 
cells and individual colonies were checked for their 
plasmid content. The desired plasmid pJLK04 was 
identified by its restriction pattern. 

Construction of plasmid pJLKOS 

Plasmid pJLKOl (PCT/GB97/01819 ) was digested with PstI 
and Avrll and the 460 bp insert was ligated with plasmid 

pJLK04 which had been digested with PstI and Avrll. The 
ligation mixture was used to transform electrocompetent 
E. coli DHIOB cells and individual colonies were checked 
for their plasmid content. The desired plasmid pJLKOS was 
identified by its restriction pattern. 
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Construction of plasmid pJLK07 

Plasmid pJLKOS was digested with Seal and Xbal and 
plasmid pNTEPH was digested with Ndel and Seal and these 
5 two fragments were ligated with plasmid pCJR24 which had 
been digested with Ndel and Xbal . The ligation mixture 
was used to transform electrocompetent E. coli DHIOB 
cells and individual colonies were checked for their 
plasmid content. The desired plasmid pJLK07 was 
10 identified by its restriction pattern. 

Construction of plasmid pJLK114 

The two synthetic oligonucleotides Plf and Plb (Figure 7) 
15 were each dissolved in TE-buffer. 10 iil of each solution 
(0 .5nmol//Lil) were mixed and heated for 2 minutes to 65C 
and then slowly cooled down to room temperature, Plasmid 
pJLK07 was digested with Avrll and Hpal and ligated with 
the annealed oligonucleotides. The ligation mixture was 
20 used to transform electrocompetent E. coli DHIOB cells 
and individual colonies were checked for their plasmid 
content. The desired plasmid pJLKll4 was identified by 
its restriction pattern, 

25 Plasmid pJLKll? is a pCJR24 based plasmid containing a 

PKS gene comprising the ery loading module, the first and 
the second extension modules of the ery PKS and the- ery 
chain- terminating thioesterase except that the DNA 
segment between the end of the acyltransf erase and the 
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beginning of the ACP of the second ery extension module 
has been substituted by a synthetic oligonucleotide 
linker containing the recognition sites of the following 
restriction enzymes. Avrll, Bglll, SnaBI, PstI, Spel, 
5 Nsil, Bsu36I and Nhel. 

It was constructed via several intermediate plasmids as 
follows (Figure 6) . 

10 Construction of plasmid pJLKllS 

Plasmid PJLK114 was digested with Ndel and Xbal and the 
approximately 9 . 9 kbp insert was ligated with plasmid 
PUC18 which had been digested with Ndel and Xbal. The 
15 ligation mixture was used to transform electrocompetent 
E. coli DHIOB cells and individual colonies were checked 
for their plasmid content. The desired plasmid pJLKllS 
was identified by its restriction pattern. 

20 Construction of plasmid pJLKllS 

Plasmid PJLK13 (PCT/GB97/01819) was digested with Bsu36I 
and Xbal and the l.l kbp fragment was ligated with 
plasmid pJLKllS which had been digested with Bsu36I and 
Xbal . The ligation mixture was used to transform 
electrocompetent E. coli DHlOB cells and individual 
colonies were checked for their plasmid content. The 
desired plasmid pJLKlie was identified by its restriction 
pattern . 
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Construction of plasmid pJLKll? 

Plasmid PJLK116 was digested with Ndel and Xbal and the 
9.9 kbp fragment was ligated with plasmid pCJR24 which 
had been digested with Ndel and Xbal. The ligation 
mixture was used to transform electrocompetent E. coli 
DHlOB cells and individual colonies were checked for 
their plasmid content. The desired plasmid pJLKll7 was 
identified by its restriction pattern. 

Example 11 

Construction of plasmid pJLK29 

• Plasmid PJLK29 is a pJLKllV based plasmid except that the 
DNA fragment encoding the reductive loop of module 10 of 
the rap PKS has been inserted into the mcs . it was 
constructed via several intermediate plasmids as follows. 
(Figure 5) 

Construction of plasmid pJLK121.l 

The approximately 2 . 2 kbp DNA segment of the rapB gene of 

S. hygroscopicus encoding the reductive loop of module 10 

was amplified by PGR using as primers the synthetic 
oligonucleotides : 

5 ■ - TAAGATCTTCCGACGTACGCGTTCCAGC - 3 ■ and 

5'-ATGCTAGCCACTGCGCCGACGAATCACCGGTGG-3' and as template 
an approximately 7 kbp fragment, which has been obtained 
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by digestion of cosmid cos 26 (Schwecke, T. et al. (1995) 
Proc. Natl. Acad. Sci . USA 92:7839-7843) with Seal and 
Sphl. The PGR product was treated with T4 polynucleotide 
kinase and then ligated with plasmid pUCl8, which had 
been linearised by digestion with Smal and then treated 
with alkaline phosphatase. The ligation mixture was used 
to transform electrocompetent E. coli DHlOB cells and 
individual colonies were checked for their plasmid 
content. The desired plasmid pJLKl2l.l was identified by 
its restriction pattern and DNA sequencing. 

Construction of plasmid pJLK29 

Plasmid PJLK121.1 was digested with Bglll and Nhel and 
the 2.2 kbp fragment was ligated with plasmid pJLK117 
which had been digested with Bglli and Nhel . The ligation 
mixture was used to transform electrocompetent E. coli 
DHlOB cells and individual colonies were checked for 
their plasmid content. The desired plasmid pJLK29 was 
identified by its restriction pattern. 

Example 24 

Construction of Plasmid pJLK50 

The approximately 6 . 1 kbp DNA segment of the erythromycin 
PKS gene cluster of S. erythraea encoding the DNA 
fragment from the beginning of the ACP of module 2 to the 
beginning of the ACP of module 3 was amplified by PGR 
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using as primers the synthetic oligonucleotides: 
5 ' -TACCTGAGGGACCGGCTAGCGGGTCTGCCGCGTG-3 • and 
5 ' -ATGCTAGCCGTTGTGCCGGCTCGCCGGTCGGTCC-3 ' and plasmid 
pBAM25 (published pBK25 by Best, D J et al . Eur J Biochem 
(1992) 204: 39-49) as template. The PGR product was 
treated with T4 polynucleotide kinase and then ligated 
with plasmid pUClS, which had been linearised by 
digestion with Smal and then treated with alkaline 
phosphatase. The ligation mixture was used to transform 
electrocompetent E. coli DHIOB cells and individual 
colonies were checked for their plasmid content . The 
desired plasmid pJLKSO was identified by its restriction 
pattern and DNA sequencing. 

Example 25 

Construction of S.erythraea strain JLKIO 

Strain JLKIO is a variant of strain NRRL233 8 in which the 
reductive loop of ery module 2 (i.e. the KR domain) is 

replaced by the reductive loop of the rapamycin module 
10. It was constructed using plasmid pJIjK54 which was 
constructed as follows. 

Construction of plasmid pJLK54 

Plasmid pJLK54 is a pJLK2 9 based plasmid containing a PKS 
gene comprising the ery loading module, the first, the 
second and the third extension modules of the ery cluster 
and the ery chain- terminating thioesterase except that 
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the DNA segment between the end of the acyltransf erase 
and the beginning of the ACP of the second ery extension 
module has been substituted by the equivalent segment of 
module 10 of the rapamycin PKS , 
It was constructed as follows, 

Plasmid pJLKSO was digested with Nhel and the 6.1 kbp 
insert was ligated with plasmid pJLK29 which had been 
digested with Nhel. The ligation mixture was used to 
transform electrocompetent coli DHIOB cells and 
individual colonies were checked for their plasmid 
content. The desired plasmid pJLK54 was identified by its 
restriction pattern. 

Use of plasmid pJLK54 for construction of S . erythraea 
NRRL2338/pJLK54 and the production of TKL derivatives 

Approximately 5 fig plasmid pJLK54 were used to transform 
protoplasts of S. erythraea NRRL2338 and stable 
thiostrepton resistant colonies were isolated. From 
several colonies total DNA is obtained and analysed by 
Southern blot hybridisation, to confirm that the plasmid 
has integrated into the TE. 

Construction of S. erythraea strain JLKIO and its use in 

production of 13 -methyl-10, 11-dehydro-erythromycin A 

S , erythraea strain JLKIO is a mutant of S . erythraea 
NRRL233 8 in which the » reductive loop' of ery module 2 
i.e. the ketoreductase domain is substituted by the 
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•reductive loop' of rapamycin module 10. It was 
constructed starting from S. erythraea NRRL2338 into 
which plasmid pJLK54 had been integrated. S. erythraea 
NRRL2338/pJLK54 was subjected to several rounds of non- 
5 selective growth which resulted in second crossover 
concomitant with the loss of the integrated plasmid. 
Clones in which replacement of the erythromycin gene 
coding for DEBSl with the mutant version had occurred, 
were identified by Southern blot hybridisation. One of 
10 these was named S. erythraea strain JLKIO and was used to 
inoculate SM3 medium (eryP medium gave similar results) , 
and allowed to grow for seven to ten days at 28-3 0''C. 
After this time the broth was centrifuged and the pH of 
the supernatant adjusted to pH 9 . The supernatant was 
15 then extracted three times with an equal volume of ethyl 
acetate and the solvent was removed by evaporation. 
Products were analysed by HPLC/MS, MS /MS and IH-NMR. The 
following macrolide C-13 methyl erythromycin A was 
identified (accompanied by products of incomplete 




Plasmid pPFLSO is a pPFL43 -based plasmid from which a DNA 
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fragment encoding KRl (in part) , ACPI and module 2 of 
the erythromycin PKS and the erythromycin TE, has been 
removed. It was constructed as follows, Plasmid pPFL43 
was digested with Sful and Xbal to remove a 6.5 kb 
5 fragment . The 5 • overhangs were filled in with Klenow 
fragment DNA Polymerase I and the plasmid was 
recircularised. The ligation mixture was used to 
transform electrocompetent E. coli DHIOB cells and 
individual colonies were checked for their plasmid 
10 content. The desired plasmid pPFLBO was identified by its 
restriction pattern. 

Construction of S. erythraea JLKlO/pPFLSO 

Approximately 5 |i.g plasmid pPFLSO were used to transform 

15 protoplasts of S. erythraea strain JLKIO and stable 
thiostrepton resistant colonies were isolated. From 
several colonies total DNA was obtained and analysed by 
Southern blot hybridisation, to confirm that the plasmid 
had integrated into the homologous chromosomal DNA 

20 region. S. erythraea strain JLK10/pPFL50 was used to 
inoculate SM3 medium containing 5 |j.g/ml thiostrepton 
(eryP medium containing 5 |ag/ml thiostrepton gave similar 
results) and allowed to grow for seven to ten days at 28- 
30oC. After this time the broth was centrifuged and the 

25 pH of the supernatant adjusted to pH 9 . The supernatant 
was then extracted three times with an equal volume of 
ethyl acetate and the solvent was removed by evaporation. 
Products were analysed by HPLC/MS, MS /MS and IH-NMR. The 
macrolide C-13 methyl 10 , 11-dehydro-erythromycin A was 



wo 00/00500 



PCT/GB99/02042 



51 



10 



identified (accompanied by products of incomplete 
processing by post-PKS enzymes) 

Construction of S. erythraea NRRL2338/pPFL50 

Approximately 5 plasmid pPFL50 were used to transform 
protoplasts of S. erythraea NRRL2338 and stable 
thiostrepton resistant colonies were isolated. From 
several colonies total DNA was obtained and analysed by 
Southern blot hybridisation, to confirm that the plasmid 
had integrated into the homologous region of the 
chromosomal DNA. S. erythraea NRRL233 8/pPFL50 was used to 
inoculate SM3 medium containing 5 |xg/ml thiostrepton 
(eryP medium containing 5 |ag/ml thiostrepton gives 
similar results) and allowed to grow for seven to ten 
15 days at 28-30oC. After this time the broth was 

centrifuged and the pH of the supernatant adjusted to pH 
9.5. The supernatant was then extracted three times with 
an equal volume of ethyl acetate and the solvent was 
removed by evaporation. Products were analysed by 
20 HPLC/MS, MS /MS and IH-NMR. The macrolide C-13 methyl 

erythromycin A was identified (accompanied by products of 
incomplete processing by post-PKS enzymes) . 



25 



Construction of plasmid pCB121 

Plasmid pCB121 is a plasmid containing the monensin 
loading module and KS of monensin module l followed by 
the erythromycin module 1 AT and part of the erythromycin 
modulel KR . It was constructed via several intermediate 
plasmids as follows . 
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Construction of plasmid pPPL45 

The approximately 1.8 kbp DNA segment of the monensin PKS 
gene cluster of Streptomyces cinnamonensis encoding part 
of the ACP of the loading module and KS of module 1 was 
5 amplified by PGR using as primers the synthetic 
oligonucleotides : 

5 ' -CGTTCCTGAGGTCGCTGGCCCAGGCGTA-3 ' 
5 • -CGAAGCTTGACACCGCGGCGCGGCGCGG-5 ' 

and a cosmid containing the 5' end of the monensin PKS 
10 genes from S. cinnamonensis or alternatively chromosomal 
DNA of S. cinnamonensis as template. The PGR product was 
treated with T4 polynucleotide kinase and then ligated 
with plasmid pUCl8, which had been linearised by 
digestion with Smal and then treated with alkaline 
15 phosphatase. The ligation mixture was used to transform 
electrocompetent E. coli DHIOB cells and individual 
colonies were checked for their plasmid content. The 
desired plasmid pPFL45 was identified by its restriction 
pattern . 

Construction of plasmid pPFL47 

Plasmid pPFL45 was digested with Ndel and Bsu36I and the approximately 2.6 kbp 
fragment was Ugatcd into plasmid pPFM3 which had been digested witii Ndel and 
Bsu36I. The ligation mixture was used to transform electrocompetent E. coli DHIOB 
cells and individual colonics were checked for their plasmid content The desired 
plasmid pPFL47 was identified by its restriction pattern. 

Construction of plasmid pCB13 5 

Plasmid pCJR24 was digested with Hindlll, the 5* overhang 
was filled in with Klenow fragment DNA Polymerase I and 
25 religated. The ligation mixture was used to transform 
electrocompetent E. coli DHIOB cells and individual 
colonies were checked for their plasmid content. The 
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desired plasmid pCB135 was identified by its restriction 
pattern, lacking the recognition site for Hindlll. 
Construction o£ plasmid pKSWl 

Plasmid pKSlW is a pNTEP2 (GB97/01810) -derived vector 
containing a DEBS ITE- derived triketide synthase with the 
unique restriction sites introduced at the limits of KSl. 
5 Plasmid pKSlW is obtained via several intermediate 
plasmids as follows . 

Construction of plasmids pMO09, pMOlO and pM013 

For the PGR amplification for plasmid pM009, the 
10 following synthetic oligonucleotides were used as 

mutagenic primers, one containing a Muni site and the 
other a PstI site: 

5 ' -GCGCGCCAATTGCGTGCACATCTCGAT- 3 ' 

and 5' -CCTGCAGGCCATCGCGACGACCGCGACCGGTTCGCCG- 3* 

15 

For the PCR amplification for plasmid pMOlO, the 
following synthetic oligonucleotides were used as 
mutagenic primers, one containing a Hindlll site and the 
other an EcoRV site: 
20 5' -GTCTCAAGCTTCGGCATCAGCGGCACCAA- 3' 

and 5' -CGTGCGATATCCCTGCTCGGCGAGCGCA-3 • 

For the PCR amplification for plasmid pM013, the 
following synthetic oligonucleotides were used as 
25 mutagenic primers, one containing a PstI site and the 
other a Hindlll site: 

5' -GATGGCCTGCAGGCTGCCCGGCGGTGTGAGCA- 3' 

and 5\ -GCCGAAGCTTGAGACCCCCGCCCGGCGCGGTCGC- 3» 
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PGR was carried out on pNTEP2 (GB97/01810) as template 
using Pwo DNA polymerase and one cycle of: 96**C (Imin) ; 
annealing at 50°C (3min) / and extension at 72^*0 (Imin) , 
and 25 cycles of: 96^*0 (Imin); annealing at 50°C (litiin) ; 
5 and extension at 72**C (Imin) in the presence of 10% 
(vol/vol) dime thy Isulphoxide. The products were end- 
repaired and cloned into pUClS digested with Smal and the 
ligation mixture was transformed into E. coli DH lOB. 
Plasmid DNA was prepared from individual colonies. The 
10 desired plasmids for pMO09 (3.8kbp), pMOlO (3.9 kbp) and 
pM013 (4.3 kbp) were identified by their restriction 
pattern and DNA sequencing. 

Construction of plasmid pMOll 

15 Plasmid pM013 was digested with Hindlll, and the 1,2 kbp 
insert was cloned into pMOlO which had been digested with 
Hindlll, The ligation mixture was transformed into E. 
coli DH lOB. The desired plasmid (5.0 kbp) was 
identified by its restriction pattern and designated 

20 pMOll. 

Construction of plasmid pM012 

Plasmid pMO09 was digested with PstI, and the 1.6 kbp 
insert was cloned into pMOll which had been digested with 
25 Pstl. The ligation mixture was transformed into E. coli 
DH lOB. The desired plasmid (6.6 kbp) was identified by 
its restriction pattern and designated pM012 . 

Constzruction of pKSlW 
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Plasmid pM012 was digested with Muni and EcoRV, and the 
3.9 kbp fragment was cloned into pNTEPH (see below) which 
had been digested with Muni and EcoRV. The ligation 
mixture was transformed into E. coli DH lOB, The desired 
5 plasmid (13. kbp) was identified by its restriction 
pattern and designated pKSlW. 

Construction of pNTEPH 

Plasmid pNTEPH was obtained from pNTEP2 by removing the 
10 Hindlll site. pNTEP2 was digested with Hindlll, the 5' 
overhang was filled in with Klenow Fragment DNA 
Polymerase I and religated. The desired plasmid (13,6 
kbp) was identified by its restriction pattern. 

15 Construction of plasmid pCB13 6 

Plasmid pKSWl was digested with Ndel and Xbal and the 
approximately 11.2 kbp fragment was ligated with plasmid 
pCB135 which had been digested with Ndel and Xbal. The 
ligation mixture was used to transform electrocompetent 
20 E. coli DHIOB cells and individual colonies were checked 
for their plasmid content. The desired plasmid pCB136 was 
identified by its' restriction pattern. 

Construction of plasmid pCB137 
25 Plasmid pCB13 6 was digested with Sful and Xbal to remove 
a 6.5 kb fragment,, the 5' overhangs were filled in with 
Klenow Fragment DNA Polymerase I and religated. The 
ligation mixture was used to transform electrocompetent 
E. coli DHIOB cells and individual colonies were checked 
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for their plasmid content. The desired plasmid pCB137 was 
identified by its restriction pattern* 

Construction of plasmid pCB121 

5 Plasmid pPFL47 was digested with Ndel and Hindlll and the 
approximately 4.4 kbp insert was ligated with plasmid 
pCB137 which had been digested with Ndel and Hindlll. The 
ligation mixture was used to transform electrocompetent 
E. coli DHIOB cells and individual colonies were checked 
10 for their plasmid content. The desired plasmid pCB121 was 
identified by its restriction pattern. 

Example 

Construction of S.erythraea JLK10/pCB121 

15 Approximately 5 \ig plasmid pCB121 were used to transform 
protoplasts of S . erythraea JLKIO and stable thiostrepton 
resistant colonies were isolated. From several colonies 
total DNA was obtained and analysed by Southern blot 
hybridisation, to confirm that the plasmid had integrated 

20 into the homologous chromosomal DNA region. S. erythraea 
strain JLK10/pCB121 was used to inoculate SM3 medium 
containing 5 [ig/val thiostrepton (eryP medium containing 5 
l-ig/ml thiostrepton gave similar results) and allowed to 
grov; for seven to ten days at 28-30oC, After this time 

25 the broth was centrifuged and the pH of the supernatant 
adjusted to pH 9 . The supernatant was then extracted 
three times with an equal volume of ethyl acetate and the 
solvent was removed by evaporation. Products were 
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10 



analysed by HPLC/MS, MS /MS and IH-NMR. The macrolide C13. 
methyl-iO,ll-dehydro-erythromycin A was identified 
(accompanied by products of incomplete processing by 
post-PKS enzymes) : 



Example 

Construction of S. erythraea NRRL2338/pCB121 

Approximately 5 ng plasmid pCBi2i were used to transform 
protoplasts of S. erythraea NRRL2338 and stable 
thiostrepton resistant colonies were isolated. From 
several colonies total DNA was obtained and analysed by 
Southern blot hybridisation, to confirm that the plasmid 
had integrated into the homologous chromosomal DNA 
region. S. erythraea NRRL2338/pPFL50 was used to 
15 inoculate SM3 medium containing 5 ng/ml thiostrepton 

(eryP medium containing 5 ^g/ml thiostrepton gave similar 
results) and allowed to grow for seven to ten days at 28- 
30oC. After this time the broth was centrifuged and the 
pH of the supernatant adjusted to pH=9 . . The supernatant 
20 was then extracted three times with an equal volume of 

ethyl acetate and- the solvent was removed by evaporation. 
Products were analysed by HPLC/MS, MS/MS and IH-NMR. The 
macrolide C13 -erythromycin A was identified (accompanied 
by products of incomplete processing by post-PKS 
25 enzymes) : 

Although the present invention is illustrated by the 
examples listed above, they should not be regarded as 
limiting the scope of the invention. The above 
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descriptions illustrate for the first time the 
construction of a Type I PKS gene assembly containing a 
wholly or partly heterologous KSq- containing loading 
module and its use to obtain polyketide products of 
utility as synthetic intermediates or as bioactive 
materials such as antibiotics. It will readily occur to 
the person skilled in the art that a wholly or partly 
heterologous KSq- containing loading module from other PKS 
gene sets could be used to replace the loading module of 
DEBS, or indeed into a quite different PKS gene assembly. 
It will also readily occur to the person skilled in the 
art that that the additional specificity provided by the 
more efficient discrimination made between methylmalonyl- 
CoA and malonyl-CoA by an ATq, followed by specific 
decarboxylation by a KSq, is preferable to the imperfect 
discrimination between propionyl-CoA and acetyl -CoA that 
is a feature of the DEBS loading module and of many other 
PKS loading modules, in that it maximises the production 
of a single product rather than a mixture differing from 
each other in the nature of the starter unit . The 
avoidance of such mixtures increases yields and avoids 
the need for tedious and difficult separation procedures . 
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Claims 

1. A 14-inember macrolide which incorporates an 
acetate starter unit so that it has a 13-methyl 
substituent, with the proviso that it is not 

5 norerythromycin C, 6-deoxy-15-norerythromycin B or 6- 

deoxy-15-norerythromycin D , 

2. 15-norerythromycin A. 
10 3. 15-norerythromycin B. 

4. A compound of the formula 1: 




HaC ORi2 

15 

or a pharmaceut ically acceptable salt thereof, wherein: 

Ri is H or OH; R2-R4 are each independently H, CH3, or 
CH2CH3; Rs is H or OH; and Re is H, CH3, or CH2CH3; R7 is H 
or desosamine; Re is H, CH3, or CH2CH3; R9 is OH, mycarose 
20 (R12 is H) , or cladinose (R12 is CH3) , Rio is H; or R9 = Rio 
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= O; and Rn is H, CH3, or CH2CH3, with the proviso that 
when R2-R4 are CHar Re is CH3/ Rq is CH3, and Rn is CH3, 
then Ri and R5 are not H and R12 is not H; or also when 
R2-R4 are CH3, Re is CH3, Rs is CH3, and Rn is CH3, then Ri 
and R5 are not OH and R12 is not H, 



10 



15 



5. A compound according to claim 4 wherein Ri is OH; R2- 
R/j are CH3 ; R5 is OH; Re is CH3, R7 is desosamine; Re is 
CH3; R9 is cladinose (R12 is CH3) ; and Rn is CH3 

6. A compound according to claim 4 wherein Ri is H; R2- 
R4 are CH3 ; R5 is OH; Re is CH3, R7 is desosamine; Re 
is CH3; R9 is cladinose (R12 is CH3) ; and Rn is CH3. 

7. A process for making compounds of the formula 1: 

IJJ(CH3)2 

HC 

R7 = H or 

CHa 





OR7 



RlO = OHor 




OH 



H3C OR12 



wherein : 

Ri is H or OH; R2-R4 are each independently H, CH3, or 
CH2CH3; R5 is H or OH; and Re is H, CH3, or CH2CH3; R7 is H 
or desosamine; Re is H, CH3r or CH2CH3; R9 is OH, mycarose 
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(Ri2 is H) r or cladinose (R12 is CH3) , Rio is H; or R9 = Rio 
= O; and Rn is H, CH3, or CH2CH3 

8. A process for making compound of the formula 1^ as 
set out in claim 7 wherein Ri is OH; R2-R4 are CH3 ; R5 is 

5 OH; Re is CH^, R7 is desosamine; Re is CH3; R9 is cladinose 

(R12 is CH3) ; and Ru is CH3 

9. A process for making compound of the formula _1 as 
set out in claim 7 wherein Ri is H; R2-R4 are CH3 ; R5 is 
OH; Re is CH3, R7 is desosamine; Re is CH3; R9 is cladinose 

10 (R12 is CH3) ; and Ru is CH3 

10. A system for producing a 14-membered macrolide 
incorporating an acetate starter unit, said system 
comprising DNA encoding and arranged to express a PKS 
multienzyme which comprises a loading module and a 

15 plurality of extension modules; wherein in the expressed 

multienzyme, said loading module is adapted to load a 
malonyl residue and then to effect a decarboxylation of 
the loaded residue to provide an acetate starter unit 
which is transferred to an adjacent one of said extension 

20 modules; and wherein the extension modules, or at least 

one thereof, are not naturally associated with a loading 
module that effects decarboxylation. 

11. A system according to claim 10 wherein the macrolide 
25 is a compound of formula 1 as defined in any of claims 

4-9. 
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12. A system according to claim 10 or 11 wherein said 
adjacent extension module to which the acetate starter is 
transferred is not naturally associated with a loading 

5 module that effects decarboxylation. 

13. A system according to claim 10, 11 or 12 wherein the 
decarboxylating functionality of the loading module is 
provided by a ketosynthase-type domain having a glutamine 

10 residue in the active site. 

14. A system according to claim 10, 11 or 12 wherein the 
decarboxylating functionality of the loading module is 
provided by a CLF-type domain. 

15 

15. A system according to claim 14 wherein the CLF-type 
domain is substantially as any shown in Fig 2. 

16. A system according to any of claims 10-15 wherein 

20 the loading module's loading functionality is provided by 

an acyltransf erase-type domain having an arginine residue 
in the active site. 

17. A system according to any of claims 10-16 wherein 
25 the loading module includes an acyl carrier protein. 
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18. A system according to any of" claims 10-13, 16 or 17 
wherein at least the KSq domain of said loading module 
corresponds to the loading module of the PKS multienzyme 
of oleandomycin, spiramycin, niddamycin, methymycin, or 

5 monensin. 

19. A PKS multienzyme as expressible by the DNA of the 
system of any of claims 10-18 or a variant having the 
ability to synthesise a compound of formula 1. 

10 

20. Nucleic acid encoding the PKS multienzyme of 
claim 19. 

21. A vector containing nucleic acid as defined in 
15 claim 20. 

22. A transformant organism comprising a system 
according to any of claims 10-18. 

20 23. A process according to claim 7, 8, or 9 which 

comprises culturing an organism according to claim 22 and 

recovering a compound of formula 1. 

24 . A process according to claim 23 wherein said 
25 macrolide is a compound of formula 1 as defined in any of 

claims 4-9- 
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25. A system, organism or process according to any of 
claims 10-24 wherein the plurality of extension modules 
corresponds to the extension modules of a PKS selected 
from erythromycin, narbomycin, pikromycin, lankamycin, 
kujimycin or megalomycin or a mutant or variant thereof 
able to direct synthesis of a macrolide. 
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KCLFDAU KVTGIjGIVAPNGT/>VGAJWD^ 

KJCLFPEXJ MTCrrAAKTASSQUiASPAGRRGI^R\V\rrX3IjGIVAPt« 

KCLFACT MSVLITCVGWAIIISI/UAPYWSAVI^^ 

RCLFHIR MSTWVTG^GVVAm3IJGAIX)HWAATLK^^ 

IQCLPGRA MSTPDRRRAV\m3LSVAAIKXaJC7r^^ 

IQCLpNOG KTAAVVVTGLGVVAPTtXGVREHWSST^ 

KCLPTCM MSAPAFWVTGLGIVAPNGTGTEEYWAATIJ^^ 

KCLPCIN ^fIT-VAVTG^I3IAAPNGU^lPTat^^ 

IQCLFVNZ MSASVVVTGIXSVAAPNSLCaiEDF^^ 

IQCLFVVHIE MSGPQRTCriX3GGSRRAVVTX3LGVl*SPHGT^^ 

KSGRA hrnUWVITGVGVRAPGGSGTKEFWDIJLiTAGOT 

KSHIR MTRRVVITGVGVRAPGGUSAKNF^^ 

j^SACT MKI^VVITGVGVRAPGCacrreQF^^ 

KSCIN ---KTORKVAITGIEVIAPGGlJtJRK^^ 

j^^yj^ KTARRVVITGIEVIJVPGGTCSKAFWta^EXSR^^ 

j^SjjOG MKESINRRVVITGIGIVAPnATGVKPFWDIXT^ 

j^g-jQ^ hrnUiAEK3^VVTTGIGVRAPGGAGTAAFWDI^TA<^^ 

KSDAU M^^U^VVITG^^3VVAPGAIGIKSFWELIJ[-^^ 

KSPEU M^^lR^V^:TGIGVVAPGAVGTKPFWEIZ^^ 

ICSWHI hm^VAVTGIGWAPGGIGTPQFWRI^HSRTATR^ 



KCLFDAU RFADDGRI/SRr^GEVSDFVP-EDHLPKRLLVQTDPMIXS^^ — 

KCLFPEU RFTGDSM^RLAGEVSDFVP-EDHLP — 

KCLFACT RFDVSRYPATIJsjGQIDDFHA-PraiPGRIXPQ^ 

KCLFHIR RFDPTGryPAELAGQVIX)FIV»i-TEHLPI^^ 

KCLFGRA RFDASRYPSRIJVGQIDDFEA-SEHLPSRLLPQTDVSTRy- 

KCLFNOG RFDAGRYPSKIJ^EVPGFV^-EDHLPSRr^l^^ 

KCLFTCM RFDPHGYFVRVGGEVLAFDA-AAHLPGRIJLPQTO 

KCLTCIN RFDPSGyPAQtJ>J3EnCPGFR^-AEKLPGRLVPQTDKVT^ 

Kd-rVNZ RFDPTGYPARI^EVPGFAA-EE3iLPSRIJ:.PQTDR 

KCLFWHIE , REGCAHLPLRVAGEVHGFDA-AETVEDRFLVQTDR 

KSGM. * FFDASPFT^RIAGEI "DFI3AVABgFSPRE\mRMDRATQF"AVAG^^ 

KSHIR t . |: Qppp^^^^xAABC-DroPEHEGI^PREIRRMDRAAQF"AVVC^ 

KSACT FTT)PSFYRSQVAAJE:A-DFDPVAEX5FGPREIX)RMDR^ 

KSCIN FFDPAPFRSKVAAJEA-DFCGIX1^3LSPQEVRRMDRAAQF-AV\^ 

KSVN2 FFDPTPFTlSRVAAEI-DFDPEAHGr^PQEIRRMDRAAQF-AVVAAR-AVADSGIDI^ 

KSNOG AFDPSPFRSRIAABC-DFDPLAEGLTPQQIRBfflDRATQF-AWSARESICTSG 

KSTCM LFDAAPyRSRIAGEI-DFDPIGEGI>SPRQASTroRATQL-A\^^ 

KSDAU tfdatpfi^riaabc-dfdpvaagi>saeqarrij:^ragqf-al\^ 

KSPEU tfdatpfrsriaaec-dfdpvaaglsaeoarrijdr^gqf-;^ 

KSWHI lfdpsguisqiaaext-dfepsdhglgi^taqrcdrwqf-alvaaseavrd^^ 



Fig 2a 
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KCLFDAU 
KCLFPEU 
KCLFACT 
KCLFHIR 
KCLFGRA 
KCLFNOG 
KCLFTCM 
KCl^IN 
KCLFVNZ 
KCLFWHIE 



-PLEAGVITASASGGFASGQRELQNLWSKG PAHVSAVMSFAWFY-AVimKJIAIR 

-PLEAGVITASASGGFAFGQRELQNLWSKG PAHVSAYMSFAWFY-AVNTGQIAIR 

Tirm^GWTANACGGFDFTHREFRKLWSBG PKSVSVYESFAWFV-AVNTGQISIR 

PEYGTGVITSNATGGFEFTHREFRKLWAQG PEFVSVYESFAWFV-AVNTGQISIR 

DDYDLGWTSTAQGGFDFTHREFHKLWSQG PAYVSVYESFAWFY-AVNTGQISIR 

PEYGVGWTASSAGGFEFGHRELQNLWSLG PQYVSAYQSFAWFY- AVNTCQVSIR 

DEYGLGVLTAAGAGGFEFGQRE24QKLW3TG PERVSAYQSFAWFY-AVNTCQISIR 

DPLIX«3VVTASHAGGFEFGQDELQKLLGQG -CPVLSAYQSFAWFY-AVNSGQISIR 

IXDFEMGVVTASASGGFEPGQGELQKLWSQG SQYVSAYQSFAWFY-AVNSGQISIR 

SPYSVGWTAAGSGGGEFGQRELQNLW3HG SRHVGFYQSIAWFY-AASTOQVSIR 

DPSRIGVALGSAVASATSX.ENEYLVMSDSGREWLVDPAHLSPMMFDYL^ PGVMPAEVAWA 

PPERIGVSLGSAVAAATSLEQEYLVLSrXX^REWQ^TDPAYI^^ 

DPARVGVSLGSAVAAATSLEREYLLLSDSGRWEVCAAWLSRH^ 

PPHRIGVVVGSAVGATOGLIX^EYKVVSDGGRLDLVIMl^^ 

DPYRVGVTVGSAVGAT^CLDEEyKVVSDGGRIJDLVDHAYA\^^ 

DASRTGVVVGSAVGCTTSLEEEYAVVSDSGRNWLVDDGYAV^ 

NPERIGVSIGTAVGCTTGLDREYARVSBGGSRWLVKmA^ 

SAHKVGVCVGTAVGCTOKLESEYVALSAGGANWVVDPHRG^ 

SAHKVGVCVCTIAVGCTQKLESEYVALSAO^^ 

DPWRAGATLGTAVGGTTRLEHDYVLVSERGSRWDVDDRRSET^^ 

* . . * . - • 

-HDLRGPVGVVVAEQAGCa:J3ALAHAR-RKVRGGAE-LIVSGA^^ 

-HDLRGFVGVVVTVBQAGGLnAIJVHAR-RKVRGGAE-LIVSGAVDSSL^ 

-HC^SRGPSSALVTVEQAGGLDALGHAR-FriTRRGTP-LVVSGGVDSAL^ 

-HGLRGPGSVLVAEQAGGLnAVGHGG--AVRNGTP-MVVTGGVDSSFDP-V^^ 

-NTMRGPSAALVGEQAGGLDAIGHAR-RIVRRGPG-WC^^ 

-HGLRGPGGVLVTBQAGGIJ3ALGQAR-RQLRRGLP-M^^ 

-HGMRGHSSVFVTEQAGGLDAAAHAA-RLLRKGTLNTALTC^ 

-HC^lKGPSGVVVSIXJAGGLnALAQAR-RLVRKGTP-LIVCGAVEPRSAPC^ 

-NG^^KGPSG^AA;SDQAGGLDAVAQAR-RQIRKGTR-LIVSGGVDASLCP-V^^ 

-NDFKGPCGVVAADEAGGLDALAHAA-LAVRNGTD-TVVC^ 

-AGAEGPV^l^SIXXrI^UDSVGYAV"QGTREGSAC^^ 

- VGAEGPVAMVSDGCTSGLDSLSHAC - SLIAJEXTTTDVMVAGAADTPITP IWSCFDAI KA 

- VGAEG PVTMVSTGCTSGLDSVGNAV- RAI EEX3SADVMFAGAADTP ITP I WACFDAIRA 

- VGAEX3PSTVVSTGCTSGIDAVGIAV- ELVR£X3SVDVM^^ 
-VGAEG PNTVVSTGCTSGLDSVGYARGELIRBGSADVhnJ^SSD?^ 
RIGABGPVSLVSTGCTSGLDAVGRAA- DLIAEGAAEV^^I^ 
-AGAEGPVTVVSTXXrrSGU^AVGYGT-ELIRTCRAE^^A^^^ 

- AGAEGPVNI VSAGCTSGI DS IGYAC - ELI REGTVDVMI^GGVDAPI^ 
-AGAEGPVNIVSAGCTSGIDSIGYAC-ELIREGTVDAMVAGGVDAPIAPIT^ 

- FX^VRGPVQTVSTGCTSGLDAVGYAY - HAVADGRVDVCLAGAADSP I S PITMACPDAI KA 

RLSGSDDPTAGYLPFDRRAAGHVPGEG-GAILAVEDAERVAERG-GKVYGSIAGT-ASFD 
RI>SGS DNPTAGYLP FI)RRAAGHVPG -GAI LTVEDAER^ 
RISTATDPDRAYLPFDE2WVGYVPGB3-GAILVLEDSAAAEARGRHDAYGEL^ 
RVS RATD PGRAYLP FD^TUUgGYVPGEX^ -GAI LLLE^^ 

LVSTS^ADPERAYLPFtATDASGYVPGEG-GAVLrVEI^ AERj:yVRSPLRRD 

GLSTSDDPRRAYLPFIWVAGGHVTKSEXJMiALLVLESDESARARGVT^^ 
FLS EATD P HDA YL P FDARAAGYVPG EG -GAML V ASIADSA 

- MS D S D E PNRA YL P FD RlXi RGYVPGGGRGVVP P LERAEAAP ARG - AEVY G E - AG P L^ 
RLSTS ElE P ARGYLP FDREAQGHVPG EG -GA I LVMEAARAARERG - AR I YG E I AGYG bTFU 
ELS RATEPDRAYRP FTEIAACGFAPABG -GAVL WEEEAAARERG - ADVRATVAGHAATFT 



Fig 2b 
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KSGRA 
KSHIR 
KSACT 
KSCIN 
KSVNZ 
KSNOG 
KSTCM 
KSDAU 
KSPEU 
KSWHI 



KCLFDAU 

KCLFPEU 

KCLFACT 

KCLFHIR 

KCLFGRA 

KCLFNOG 

KCLFTCM 

KCLFCIN 

KCLFVNZ 

KCLFVJHIE 

KSGRA 

KSHIR 

KSACT 

KSCIN 

KSVNZ 

KSNOG 

KSTCM 

KSDAU 

KSPEU 

KSWHI 



TTPRNDDPAHASRPFIX3TRNGFVLABG- 
TTPRNDDPEHASRPFDNSRNGFVLABG- 
TTARNDDPEHASRPBTXrrRDGFVI.ABG- 
TTPRHDAPATASRPFDSTRNGFVIjGBG- 
TTNRYDDPAHASRPFIXrrRl^FVIJ3EX3- 
TTPRNDTPAEASRPFDRTRNGFVIX3EX3- 
TSANNDDPAHASRPFDRNRDGFVLGEX;' 
TSraNITITETLA-PFSRSRNGFVLGEXj- 

tsdhndtpetasrpfsrsrngfvlgex;" 
tspnnddpahasrpfiiadrn3;fv/mgex; 



AAMFVLEEYEAAQRRG- 
-AALFVLEELEHARARG- 
-AAMFVLEDYDSALARG- 
-AAFFVLEEXHSARRRG- 
-AAVFVI^EELESARARG- 
-AAVFVLEEFEHARRRG- 
-SAVTVLEELSAARRRG" 
-GAIWLEEAEAAVRRG 
-GAIWLEEAEAAVRRG 
-AAVL.VL.EDLEHARARG 



AHIYAEVGGYATRSQ 
-AHVYAEISGCATOLW 
-ARIKAEISGYATOCN 
-AHIVAEIAGYATRSN 
-AHIYAEIAGYAT^SN 
-ALVYAEIAGFATRCN 
-AHAYAEVRGFATRSN 
-ARIYAEIGGYASRGN 
-ARIYAEIGGYASRGN 
-ADVYCEVSGYATFGN 



-PPPGSGRP SAIJU^VETAIJU>AGUDRSDIAVVFAIXyVA-V^ 

-PPPGSGRP SAI^VETAIADAGUXSSDIAWFATCAA-VPELI^^ 

-PAPGsSp AS^RLAIiraAGTGPEWI^ 

:pSi^--45SRAA§^ 

pSSSp pmIraaoaa^^ 

"I?S?S?--.?S^AIR^^ 

"^^^SS --Sr^Sa^agtapgdir^ 
^S^S^-"SS^£^ck:rpeev^ 



SUBSTITUTE SHEET (RULE 26) 



wo 00/00500 



5/12 



PCT/GB99/02042 



KCLFDAU 

KCLFPEU 

KCLFACT 

KCLFHIR 

KCLFGRA 

KCLFWOG 

KCLFTCM 

KCLPCIN 

KCLFVNZ 

KCLFWHIE 

KSGHA 

KSHIR 

KSACT 

KSCIN 

KSVN2 

KSNOG 

KSTCM 

KSDAU 

KSPEU 

KSWHI 



P--HRVPVT\^KTLTGRLYSGAGPIJWATGIJJU^ 

P- -RKVPVm^KTLTGRLYSGAGPIJDVATAIXAI^ PDPDLPLDWTCR 
R — EGVPSrrVPK^rrTGFJaYSGGGPhDV^ 

P— SGVPVTAPKTMK5RLYSGGGPLDLVAAUJ^RI^^ PPTVHTAEPVPEHQLDLVTGD 
P- -RGVFVTAPKALTGRIjCAGGGPADLAAALLAIJ^ PATGRHRAVPDAYALDLVTCR 

p — YGVP\rrAPimaxna^GGMdjyvA^ 

P — GAVFVrrAPKTMTORLYAGGAAlJ^VATTVI^ 
P — GRVFNrrcPRmitSRIJiSGAAPLI^CAI^^ 

T — GAVF\rrAPKTtmnU-YSGAAPUDIAAAFIJ^ PDAAYGLDLWGG 
PHAARVPVTAPKTCriXnUVYCAAPVIJT^^^ — DVCHDLDLVTGR 

EHAYAVFVSSIKSMGGHSLGAIGSIEIAASVIJaEHNVA^PTAl^^ 
EHAYRTPVSSIKSMVGHSIXSAIGSIEVAACAUaEHG^^ 
EHARRTPVSSIKSMVGHSI/3AIGSLEIAACVIJa.EHG\^ 

EHAYRTPVSS IKSMVGHSLGAIGS I EIAASAIJ^MEOTVWPTANLHTPDPECDU3Y\^ 

DHAYRTPVSSIKSMVGHSLGAIGSIEIAASAIJ^MEHNNArPPT^ 

DHAYRVFVSSIKShaCHSLGAIGSI^IAASVLAITHm^ 

QRAYDVPVSSIKSmGHSLGAIGSLELAACAI^EHGV^PP^ 

DHAYRVP ISSVKSmGHSUSAAGSLEVAATAIJ^VE^ 

EHAYRVPISSIKSmGHSLGAVGSLEVAATAI^VEYGVIPPTAN^ 

EHAYATPMS£IKSM\A3HSI/SAIGSIELAAC:VIJ^^ 



KCLFDAU 

KCLFPEU 

KCLFACT 

KCLFHIR 

KCLFORA 

KCLFNCX5 

KCLFTCM 

KCLPCIN 

KCLFVN2 

KCLFWHIE 

KSGRA 

KSHIR 

KSACT 

KSCIN 

KSVNZ 

KSN03 

KSTCM 

KSDAU 

KSPEU 

KSWHI 

mole: -/ks2% 



PRAMADARAALWARGHGGFNSALWRGAA 

PRSLADARAALLVARGYGGFNSALWRGAA 

PRSTAPRTA-LVLARGRW3FNSAAVIJIRFAPTP 

PRHQQLGTA-LVLARGKWGFNSAVWRGVTG 

PREAALSAA-LVLARGRHGFNSAVWTLRGSDHRRPT 

PRRTPLARA-LVLARGRGGFNAAM\AAAGPRAETR 

PRELKVDTA-LVVARGM3GFNSALVVRRHG 

VRPAALOTA-LGGARGHGGFNSALVVRAGQ 

PRTAEVNTA-LVIARGHGGFUSAMWRSAN 

ARPAEPRTA-LVLARGIAfSSNSALVLRRGAVPPEGR- 

AREQKVDTV-LTVGSGPGGFQSAMVLHRPEEAA 

AREQKVDTV-LSVGSGFGGFQSAMVLRRLGGANS 

ARERKLRSV-LTVGSGFGGFQSAMVLRDAETAGAAA- 

ARDQKVDSV-LTVGSGFGGFQSAMVLTSAQ RSTV 

CRBQLTDSV-LTVGSGPGGPQSAMVLARPE RKIA 

ARACPVDTV-LTVGSGPGGFQSAMVLCGPGSRGRSAA 

ARBQKVDTV-LSVGSGFGGFOSAAVLARPKETRS 

AREKKVRHA- LTVGSGFGGFQSAMLLSRPER 

AREKRVRHA- LTVGSGFGGFQSAMLLSRLER 

ARERTLRHV-LSVGSGFXSGFQSAWLSGSEGGUI 
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ORGANISATION OF THE TYLOS IN-PRODUCING POLYKETIDE SYNTHASE 




MODULE 1 MODULE 2 



0 



MODULE 4 MODULE 5 . . , . . ^^^^ 
MODULES LV^vv^^v^.^^^vv^y.y^.yv.ys.yg» » ' ' ^ ' ' ' ^TTTT 



MODULE 6 MODULE 7 



tyJGII 



ORGANISATION OF THE SPIRAMYCIN-PRODUCING POLYKETIDE SYNTHASE 




MODULE 1 MODULE 2 



MODULE 3 

srwGXX 



smm\ 



ORGANISATION OF THE NIDDAMYCIN-PRODUCING POLYKETIDE SYNTHASE 




LD 



MODULE 1 MODULE 2 



IE 



MODULE 3 
rZ222ZZZZZZZ^ 



MODULE 4 MODULE 5 

^ Fi/WAn 



MODULE 6 



nidAlW 



MODULE 7 



nidAW 



m: malonyl transferase 

I Q ^ mm; methylmalonyl transferase 

kJ cm: ethylmaionyl transferase 

C2: unknown C2 unit transferase 
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1 50 

niddamycin MAGHGDATAQ KAQDAEKSED GSDAIAVIGM 

platenolide MS GELAISRSDD RSDAVAWGM 

monensin MAAS ASASPSGPSA GPDPIAWGM 

oleandomycin MHVPGEE NGHSIAIVGI 

tylosin MSSALRRAVQ SNCGYGDLMT SNTAAQNTGD QEDVDGPDST HGGEIAWGM 

51 100 

niddam, . . SCRFPGAPGT AEFWQLLSSG ADAWTAADG RRR GTIDA 

platenol. ACRFPGAPGI AEFWKLLTDG RDAIGRDADG RRR GMIEA 

monensin ACRLPGAPDP DAFWRLLSEG RSAVSTAPPE RRRADSGLHG P,..GGYLDR 
oleandom ACRLPGSATP QEFWRLLADS ADALDEPPAG RFPTGSLSSP PAPRGGFLDS 
tylosin SCRLPGAAGV EEFWELLRSG RGMPTRQDDG TWRAA LED 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



101 

PADFDAAFFG 
PGDFDAAFFG 
IDGFDADFFH 
IDTFDADFFN 
HAGFDAGFFG 



MSPREAAATD 
MSPREAAETD 
ISPREAVAKD 
ISPREAGVLD 
MNARQAAATD 



PQQRLVLELG 
PQQRLMLELG 
PQQRLLLELS 
PQQRLALELG 
PQHRLMLELG 



WEALEDAGIV 
WEALEDAGIV 
WEALEDAGIR 
WEALEDAGIV 
WEALEDAGIV 



150 

PESLRGEAAS 
PGSLRGEAVG 
PPTLARSRTG 
PRHLRGTRTS 
PGDLTGTDTG 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



151 

VFVGAMNDDY 
VFVGAMHDDY 
VFVGAFWDDY 
VFMGAMWDDY 
VFAGVASDDY 



ATLLH . RAGA 
ATLLH . RAGA 
TDVLNLRAPG 
AHLAHARGEA 
A.VLTRRSAV 



PTDTYTATGL 
PVGPHTATGL 
AVTRHTMTGV 
ALTRHSLTGT 
SAGGYTATGL 



QHSMIANRLS 
QRAMLANRLS 
HRSILANRIS 
HRGMIANRLS 
HRALAANRLS 



200 

YFLGLRGPSL 
YVLGTRGPSL 
YAYHLAGPSL 
YALGLQGPSL 
HFLGLRGPSL 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



201 

WDTGQSSSL 
AVDTAQSSSL 
TVDTAQSSSL 
TVDTGQSSSL 
WDSAQSASL 



VAVALAVESL 
VAVALAVESL 
VAVHLACESI 
AAVHMACESL 
VAVQLACESL 



RGGTSGIALA 
RAGTSRVAVA 
RSGDSDIAFA 
ARGESDLALV 
RRGETSLAVA 



GGVNLVLAEE 
GGVNLVLADE 
GGVNLICSPR 
GGVNLVLDPA 
GGVNLILTEE 



250 

GS . AAMERVG 
GT . AAMERtrG 
TTELAAARFG 
GT . TGVERFG 
ST.TVMERMG 



niddam. . . 
platenol . 

monensin 
oleandom 
tylosin 



251 

ALSPDGRCHT 
ALSPDGRCHT 
GLSAAGRCHT 
ALSPDGRCYT 
ALSPDGRCHT 



FDARANGYVR 
FDARANGYVR 
FDARADGFVR 
FDSRANGYAR 
FDARANGYVR 



GEGGAIWLK 
GEGGAAWLK 
GEGGGLWLK 
GEGGVWVLK 
GEGGGAWLK 



PLADALADGD 
PLADALADGD 
PLAAARRDGD 
PTHRALADGD 
PLDAALADGD 



300 

RVYCWRGVA 
PVYCWRGVA 
TVYCVIRGSA 
TVYCEILGSA 
RVYCVIKGGA 



niddam , . . 
platenol . 
monensin 
oleandom 
tylos in 



301 

TGNDGGGPGL 
VGNDGGGPGL 
VNSDGTTDGI 
LNNDGATEGL 
VNNDGGGASL 



TVPDRAGQEA 
TAPDREGQEA 
TLPSGQAQQD 
TVPSAR^OAD 
TTPDREAQEA 



VLRA^^^CDQAG 
VLRAACAQAR 
WRLACRRAR 
VLRQAWERAR 
VLRQAYRRAG 



\^PADVRFVE 
VDPAEVRFVE 
ITPDQVQYVE 
VAPTDVQYVE 
VSTGAVRYVE 



350 

LHGTGTPAGD 
LHGTGTPVGD 
LHGTGTPVGD 
LHGTGTPAGD 
LHGTGTRAGD 
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niddeum. , . 
platenol . 
monensin 
oleandom 
tylosin 



niddam, . . 
platenol . 
monensin 
oleandom 
tylosin 



niddam. . . 
platenol . 
monens in 
oleandom 
tylosin 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



niddam. . , 
platenol . 
monens in 
oleandom 
tylosin 



niddam, . . 
platenol . 
monensin 
oleandom 

tylosin 



niddam. . . 
platenol . 
monensin 
oleandom 
tylosin 



351 

PVEAEALGAV 
PVEAHALGAV 
PIEAAALGAA 
PVEAEGLGTA 
PVEAAALGAV 

401 

LHERALPASL 
LRERTLPGSL 
IHHRRLAPSL 
IKNRHLPASL 
VRKGEtiVPSL 



YGTGRP-.AN EPLLVGSVKT 
HGSGRP..AD DPLLVGSVKT 
LGQDAA. .RA VPLAVGSAKT 
LGTARP.^AE APLLVGSVKT 
LGAGADSGRS TPLAVGSVKT 



NFETPNPAIP 
NFATPSPAIP 
NFTTPNPAIP 
NFTSPNPRXD 
NFSTPNPDIP 



lerlrlkvqt 
ldqlrlkvqt 
i:j\dlgltvqq 
ldalrlrvht 
lddlrlrvqt 



NIGHLEGAAG 
NIGHLEGAAG 
NVGHLEAAAG 
NIGHLEGAAG 
NVGHLEGAAG 



AHAALQPGTG 
AAAELPIiAPG 
DLADWP- .RP 
AYGPWP . . SP 
ERQEW.NEED 



400 

lAGFVKAALC 
lAGLVKAAIiC 
IVGLLKTALS 
lAGLLKTVLS 
IVGLIKATLC 

450 

GGPLLAGVSA 
"GAPLLAGVSS 
EQPLIAGVSS 
DRPLVAGVSS 
DRPRVAGVSS 



451 500 

FGMGGTNCHV VLEETPGG RQPAE.T 

FGIGGTNCHV VLEHLPSR PTPAV . S 

FGMGGTNGHV WA. . . .AAP DSVAVPEPVG VPERVEVPEP VWSEPVWP 
FGMGGTNCHV VLSELRNAGG DGAGKGPYTG TEDRLGATEA EKRPDPATGN 
FGMGGTNVHL VIAEAPAAAG SSGAGGSGAG SGAGISAVSG W 

501 550 
GQADACLFSA SPMLLLSARS EQALRAQAAR LREHL . . EDS GADPLDIAYS 
VAAS . . . LPD VPPLLLSARS EGALRAQAVR LGETV . . ERV GADPRDVAYS 

TPWP VSAHS ASALRAQAGR LRTHLAAHRP TPDAARVGHA 

GPDPAQDTHR YPALILST^S DAALRAQAER LRHHL.EHSP GQRLRDTAYS 
PVWSGRS RVWREAAGR LAE . . WEAG GVGLADVAVT 



551 

LATTRTRFEH 
LASTRTLFEH 
UATTRAPLAH 
LATRRQVFER 
MAD-RSRFGY 

601 

LALLFTGQGA 
VGVLFTGQGA 
TAFLFSGQGA 
LAFLFSGQGS 
VWGAAPGGA 



RAAVPCGDPD 
RAWPCGGRG 
RAVLLGGDTA 
HAWTGHDRE 
RAWLARGEA 



QHPGMGQELY 
QWVGMGRGLY 
QRIiGMGRELY 
QQPGMGKRLH 
GAAGGAGAAG 



RLSSAIiAALA 
ELVAALGGFA 
ELLGSLDALA 
DLLNGLRDLE 
ELAGRLRAIiA 



TTDPHFAAAL 
AGGGVFAEVL 
AVFPVFADAIi 
QVFPGFRDAL 
GAGGGGWLV 



600 

AGQTPRGVRI GS,. TDADGR 
AGRVSGGVRS GR, .A.VPGG 
EGAETASXVR GEAYT. .EGR 
NGLPAPQVLL GRTPTPEPGG 
GGDPDAGWT G. . .AVLDGG 



DEVCEELQRC 
DEVIiSMVGEV 
DEAFAAIiDVH 
DEVCAELDTH 
FPGQGTQWVG 



650 

GTQNLREVMF 
DGRSLRDVMF 
LDRPLREIVL 

LGRLL 

MGAGIiLGSSE 



651 700 

TPDQPD LLDRTEYTQP ALFALQTALY 

GDVDVDAGAG ADAGAGAGAG VGSGSGSVGG LLGRTEFAQP ALFALEVALF 

GETDSGGNVS GENVIGEGA DHQA LLDQTAYTQP ALFAIETSLY 

.GPEAGPPLR DVMFAERGT AHSA LLSETHYTQA ALFALETALF 

VFAASMRECA RALSVHVGWD LLEWSGGAG . LERVDWQP VTWAVMVSLA 



701 

RTLTARGTQA 
RALEARGVEV 
RLAASFGLKP 
RLLVQWGLKP 
RYWQAMGVDV 



HLVLGHSVGE 
SWLGHSVGE 
DYVLGHSVGE 
DHLAGHSVGE 
AAWGHSQGE 



ITAAHIAGVL 
VAAATVAGVL 
lAAAHVAGVL 
lAAAHAAGIL 
lAAATVAGAL 



DLPDAARLIT 
SLGDAVRLW 
SLPDASALVA 
DLSDAAELVA 
SLEDAAAWA 



Fiq 



^ 750 
ARAHVMGQLP 
ARGGLMGGLP 
TRGRLMQAVR 
TRGALMRSLP 
LRAGLIGRYL 
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AAEHDLDQLA HTHG.,VEIA AVNGPTHCVL 
ASESWRGW EGLGEWVSVA AVNGPRSWL 
ATADEAAEQL AGHERHVTVA AVNGPDSVW 
APESEVAPLL LGREAHVGLA AVNGPDAVW 
LPAGEVEAGL .AKWPGVEVA AVNGPASTW 



niddam. . 
platenol 
monensin 
oleandom 
tylosin 



niddam- . 
platenol 
monensin 
oleandom 
tylosin 

niddeim. . 
platenol 
monensin 
oleandom 
tylosin 

niddam. . 
platenol 
monensin 
oleandom 
tylosin 

niddcim. . 
platenol 
monensin 
oleandom 
tylosin 



751 

HG.GAMLSVQ 
VG.GGMWSVG 
AP.GAMAAWQ 
GG.GVMLSVQ 
AGRGAMAAVP 

801 

AQHLREQNVR 
VASLMGDGVE 
TAAWRGRGRK 
EQILRDRGRK 
VAVCQAEGVQ 



951 

EPETLTQAIA 
EREVFEAALA 
EVATFLRSLA 
EARSLTEAVA 
SPHRLLTSTA 



HTWLKVSHAF 
YRRLDVSHGF 
AHHLKVSHAF 
SRYLRVSHAF 
ARLIPVDYAS 



HSALMDPMLG 
HSVLMEPVLG 
HSPHMDPILD 
HSPLMEPVLE 
HSRHVEDLKG 



AFRDTLNTLN 
EFRGWESLE 
ELRAVAAGLT 
EFAEAVAGLT 
ELERVLSGI . 



800 

SG PRTALEET 
SGDVGVLESV 
SGDRATVDEL 
SGERGHVAAI 
SGDRRAVAGY 

850 

y. .QPPTIPL 
FGRVRPGVW 
FHE. .PVIPV 
FRA, .PTTPL 
.RPRSPRVPV 



851 900 

ISNLTGQIA DPNHL CTPDYWIDHA RHTVRFADAV QTAHHQGTTT 

VSGVSGGW GSGEL GDPGYWVRHA REAVRFADGV GWRGLGVGT 

VSNVTGELVT ATATGSGAGQ ADPEYWARHA REPVRFLSGV RGLCERGVTT 

VSNLTG APVDDRTM ATPAYWVRHV REAVRFGDGI RALGKLGTGS 

CSTVAGEQPG EPVF DAGYWFRNL RNRVEFSAW GGLLEEGHRR 

901 950 

YLEIGPHPTL TTLLHHTL, . .DNP T TIPTLHRERP 

LVEVGPHGVL TGMAGECLGA GDDV V WPAMRRGRA 

FVEbGPDAPL SAMARDCFPA P ADRSRPRPA AIATCRRGRD 

FLEVGPDGVLi TAMARACVTA APEPGHRGEQ GADADAHTAL LLPALRRGRD 
FIEVSAHPVL V HAIEQ TAEAADRSVH ATGTLRRQDD 



AVGVRTDGID WAVLCGASRP RRVELPTYAF 
TVFTRDAGLD ATALHTGSTG RRIDLPTTPF 
QAYVRGADVD FTRAYGATAT RRFPLPTYPF 
RLHLHGVPMD WTSVLGGDVS .RVPLPTYAF 
EAWAHGATLT WDPAL..PPG HLTTLPTYPF 



niddam: niddamycin; platenol: platenolide I (spiramycin); oleandom: 
oleandonr/cin . 



FIG . 4C 



SUBSTITUTE SHEET (RULE 26) 



wo 00/00500 



10/12 



PCT/GB99/02042 




SUBSTITUTE SHEET (RULE 26) 



wo 00/00500 



11/12 



PCT/GB99/02042 




Fig 6 

SUBSTITUTE SHEET (RULE 26) 

BNSDOCID: <WO 0000500A2J_> 



wo 00/00500 



12/12 



PCT/GB99/02042 



o 
u 

tn 

•H 





U 


m 




o 


Eh 






£h 




o 


O 










O 






u 






















o 




o 






o 




o 


o 






< 










< 






GG 


u 




o 




o 






u 






o 






o 




e) 


o 




o 






o 




ft 






fL 


(J 


o 




8 


e> 


'd 






u 


< 




(d 








U 






1 




o 




o 


U-l 


in 


o 



o 

CP 

u 

o 
u 
o 

o 
u 
o 

t- 
o 
< 

u 

*< 
o 
«< 

o 

O 



1 

u 
u 

u 
< 



I 

a 

8 

< 

u 



CO 



1 Cn 





AC 


8 


o 


o 


O 


»-( 


8 




«J 






0) 


O 






u 






u 


Id 


8 


u 


Ul 


CC 


GG 


o 




u 




i 










«-« 






o 







5 



SUBSTITUTE SHEET (RULE 26) 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 



PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) Internationa) Patent Classification 7 
C07H 17/08, A61K 31/70 



A3 



(11) International Publication Number: WO 00/00500 

(43) International Publication Date: 6 January 2000 (06.01.00) 



(21) International Application Number: PCT/GB 99/02042 

(22) International Filing Date: 29 June 1999 (29.06.99) 



(30) Priority Data: 
9814006.4 



29 June 1998 (29.06.98) 



GB 



(71) Applicants (for all designated States except US): BIOTICA 

TECHNOLOGY LIMITED [GB/GB]; 1 12 Hilis Road, Cam- 
bridge CB2 IPH (GB). PFIZER INC. [US/US]; 235 East 
42nd Street, New York, NY 10017 (US). 

(72) Inventors; and 

(75) Inventors/Applicants (/br US only): LEADLAY, Peter, Francis 
[GB/GB]; 17 Clarendon Road, Cambridge CB2 2BH (GB). 
STAUNTON, James [GB/GB]; 29 Person Road, Cambridge 
CB2 ET (GB). CORTES, Jesus [GB/GB]; 26 Cambanks, 
Union Lane, Cambridge CB4 IPZ (GB). MCARTHUR, 
Hamish, Alastair, Irvine [GB/USJ; 19 Pheasant Run Drive, 
Gales Ferry, CT 06335 (US). 



(81) Designated States: AE, AL, AM, AT, AU. AZ, BA. BB, BG. 
BR, BY, CA, CH, CN. CU. CZ, DE, DK, EE, ES, FI, GB, 
GD, GE, GH, GM, HR, HU, ID. IL, IN, IS, JP, KE. KG, 
KP, KR, KZ, LC, LK, LR, LS, LT, LU, LV, MD, MG, MK, 
MN, MW, MX, NO, NZ, PL, PT, RO, RU. SD, SE, SG, SI, 
SK, SL, TJ, TM, TR, TT, UA, UG, US, UZ, VN, YU, ZA, 
ZW, ARIPO patent (GH, GM, KE, LS, MW, SD, SL, SZ, 
UG, ZW), Eurasian patent (AM, AZ, BY, KG. KZ, MD, 
RU, TJ, TM), European patent (AT. BE, CH, CY, DE, DK, 
ES, FI, FR, GB, GR, IE, IT, LU, MC, NL, PT. SE), OAPI 
patent (BF. BJ, CF, CG. CI. CM, GA. GN, GW, ML. MR. 
NE, SN, TD. TG). 



Published 

With international search report. 

Before the expiration of the time limit for amending the claims 
and to be republished in the event of the receipt of amendments, 

(88) Date of publication of the international search report: 

16 March 2000(16.03.00) 



(74) Agents: STUART, Ian et aL; Mewbum Ellis, York House, 23 
Kingsway, London WC2B 6HP (GB). 



(54) Title: POLYKETIDES AND THEIR SYNTHESIS 



(57) Abstract 



A polyketide synthase ("PKS") of Type I is a complex multienzyme including a loading domain linked to a multiplicity of extension 
domains. Ths first extension module receives an acyl starter unit from the loading domain and each extension module adds a further ketide 
unit which may undergo processing (e.g. reduction). We have found that the Ksq domain possessed by some PKS's has decarboxylating 
activity, e.g. generating (substituted) acyl from (substituted) malonyl. The CLF domain of type II PKS's has similar activity. By inserting 
loading modules including such domains into PKS's not normally possessing them it is possible to control the starter units used. 



BNSDOCID: <WO 0000500A3J_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


sz 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgiun) 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mex ico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


C6te d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


PT 


Portugal 






cu 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DK 


Gennany 


LI 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







INTERNATIONAL SEARCH REPORT 



Interr lal Application No 

PCT/GB 99/02042 



A. CLASSiFICATION OF SUBJECT MATTER 

IPC 7 C07H17/08 A61K31/70 



According to International Patent Classification (IPC) or to both national class It icat ton artd iPC 



B. FIELDS SEARCHED 



Minimum documentation searched (classihcation system followed by classification symbols) 

IPC 7 C07H A61K 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the International search (name of data base and. where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' Citation of document, with indication, where appropriate, of the relevant passages 



WO 91 16334 A (ABBOTT LAB) 

31 October 1991 (1991-10-31) 

the whole document, but especially page 7, 

6-Deoxy-15-norerythromyci ns A-D. More 

especially 6-Deoxy-15-norerythromyc1 ns A 

and C 

-/- 



Relevant to claim No. 



1-9 



Further documents are listed in the continuation of box C. 



Patent family members are listed In annex. 



Special categories of cited documents : 

■*A" document defining the general state of the art which is not 

considered lo be of particular relevance 
"E" earlier document but published on or after the international 

filing date 

■*L" document v/hich may throw doubts on priority claim(s) or 
'.vhich ic died io establioh tf-.s publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



"T" later document published alter the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular refevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"V aocurneni oi parucuiar relevance; the ciaimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

"&" document member of the same patent family 



Date of the actual completion of the international search 



5 January 2000 



Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2260 HV Rijswijk 
Tel. (+31-70) 340-2040, Tx. 31 651 epo nl. 
Fax: (+31-70) 340-3016 



Date of mailing of the international search report 



20/01/2000 



Authorized ofticer 



Scott, J 



Form PCT/ISA/210 {second sheet) (July 1992) 
BNSDOCID: <WO O0O05O0A3_L> 



page 1 of 2 



Intern lal Appllcatton No 

PCT/GB 99/02042 



C.(Cominua»ion) DOCUMENTS CONSIDERED TO BE RELEVANT 



2 



Category ■ 


Citation ot document, wrth Indication, where appropriate, of thte relevant passages 




Y 


L.LIN ET AL.: "Biosynthesis of 


1-14 




2-Nor-6-cleoxyerythronolide B by Rationally 






Designed Domain Substitution." 






JOURNAL OF THE AMERICAN CHEMICAL SOCIETY., 






vol . 119, no. 43, 






29 October 1997 (1997-10-29), pages 






10553-10554, XP002121612 






AMERICAN CHEMICAL SOCIETY, WASHINGTON, 






DC. , US 






ISSN: 0002-7863 


1.4,7 


X 


the whole document, but especially 




compound 3 




Y 


WO 98 01546 A (CORTES JESUS ;LEADLAY PETER 


1-17 




F (GB); STAUNTON JAMES (GB); BIOTICA T) 






15 January 1998 (1998-01-15) 






cited in the application 


1,4,7 


y 


t hp whole document 


Y 


WO 98 01571 A (CORTES JESUS ;LEADLAY PETER 


1-17 




F (GB); PACEY MICHAEL S (GB); STAUNTON) 






15 January 1998 (1998-01-15) 






cited in the application 






thp whol p document 




Y 


WO 95 08548 A (UNIV LELAND STANFORD JUNIOR 


1-17 




•JOHN INNES CENTRE (GB)) 






30 March 1995 (1995-03-30) 






cited in the application 






the whole document 




P X 


WO 98 49315 A (KOSAN BIOSCIENCES INC ;UNIV 


1-17 


LELAND STANFORD JUNIOR (US)) 






5 November 1998 (1998-11-05) 






the whole document, but especially claims 






1-28 and figure 6/table 2, all compounds 






containing "nor" 




E 


WO 99 35157 A (WU YONG JIN ;PFIZER PROD 


1,4,7 




INC (US)) 15 July 1999 (1999-07-15) 






the whole document 





Form PCT/ISA/210 (continuation ot second sheet) (July 1992) 



Dane 2 of 7 



INTERNATIONAL SEARCH REPORT 

information on patent family members 



Intern tal Application No 

PCT/GB 99/02042 



Patent document 
cited in search report 



Publication 
date 



Patent family 
member(s) 



Publication 
date 



US 


5141926 


A 


25-08-1992 


CA 


2080583 


A 


19-10-1991 


EP 


0525083 


A 


03-02-1993 


JP 


2587562 


B 


05-03-1997 


JP 


5504890 
%j \j " \~) J \j 


T 


29-07-1993 


KR 


9608668 


B 


28-06-1996 


PT 


97390 


A 


31-01-1992 


All 


3450997 


A 


02-02-1998 

wL. w<» X-7^U 


All 


3451497 


A 


02-02-1998 

V C w£_ A JJt* 


PA 


??5Q4?n 


A 


15-01-1998 


PA 




A 


15-01-1998 


CN 


1229438 


A 


22-09-1999 


EP 


0909327 


A 


21-04-1999 


EP 


0910633 


A 


28-04-1999 


WO 


9801571 


A 


15-01-1998 


GB 


2331518 


A 


26-05-1999 


NO 


990012 


A 


23-02-1999 


PL 


331285 


A 


05-07-1999 


AU 


7666198 


A 


30-12-1998 


WO 


9854308 


A 


03-12-1998 



WO 9116334 



31-10-1991 



WO 9801546 



15-01-1998 



WO 9801571 A 15-01-1998 



AU 


3450997 


A 


02- 


-02- 


1998 


AU 


3451497 


A 


02- 


-02- 


1998 


CA 


2259420 


A 


15- 


-01- 


1998 


CA 


2259463 


A 


15- 


-01- 


1998 


CN 


1229438 


A 


22- 


-09- 


1999 


EP 


0909327 


A 


21- 


-04- 


1999 


EP 


0910633 


A 


28- 


-04- 


1999 


WO 


9801546 


A 


15- 


-01- 


1998 


GB 


2331518 


A 


26- 


-05- 


1999 


NO 


990012 


A 


23- 


-02- 


1999 


PL 


331285 


A 


05- 


-07- 


1999 


AU 


7666198 


A 


30- 


-12- 


1998 


WO 


9854308 


A 


03- 


-12- 


1998 



WO 


9508548 


A 


30- 


-03-1995 


US 


5672491 


A 


30-09-1997 












AU 


678058 


B 


15-05-1997 












AU 


7731794 


A 


10-04-1995 












CA 


2171629 


A 


30-03-1995 












EP 


0725778 


A 


14-08-1996 












JP 


9505983 


T 


17-06-1997 












US 


5712146 


A 


27-01-1998 












US 


5830750 


A 


03-11-1998 












US 


5843718 


A 


01-12-1998 












US 


5962290 


A 


05-10-1999 


WO 


9849315 


A 


05- 


-11-1998 


AU 


7172298 


A 


24-11-1998 


WO 


9935157 


A 


15- 


-07-1999 


AU 


1501599 


A 


26-07-1999 



Form PCT/lSA/210 (patent lamUy annex) (July 1992) 
BNSDOCID: <WO 0000500A3J_> 



THIS PAGE BIAMK«)8PT0) 



